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HYPERSENSITIVE RESPONSE ELICITOR FROM 
ERWJNIA AMYLOVORA , ITS USE, AND ENCODING GENE 




_£}$fi~p> application claims benefit of U.S. Provisional Patent Application 

5 Serial No. 60/0S5#W, filed August 4, 1997. 



The present invention relates to a hypersensitive response elicitor from 
10 Erwinia amylovora, its use, and encoding gene. 

BACKGROUND OF THE INVENTION 

Interactions between bacterial pathogens and their plant hosts generally 
15 fall into two categories: (1) compatible (pathogen-host), leading to intercellular 

bacterial growth, symptom development, and disease development in the host plant; 
and (2) incompatible (pathogen-nonhost), resulting in the hypersensitive response, a 
particular type of incompatible interaction occurring, without progressive disease 
symptoms. During compatible interactions on host plants, bacterial populations 
20 increase dramatically and progressive symptoms occur. During incompatible 

interactions, bacterial populations do not increase, and progressive symptoms do not 



is associated with the active defense of plants against many pathogens (Kiraly, Z., 
25 "Defenses Triggered by the Invader: Hypersensitivity," pages 201-224 in: Plant 
Disease: An Advanced Treatise , Vol. 5, J.G. Horsfall and E.B. Cowling, ed. 
Academic Press New York (1980); Klement, Z., "Hypersensitivity," pages 149-177 
in: Phvtopatho genie Prokarvotes , Vol. 2, M.S. Mount and G.H. Lacy, ed. Academic 
Press, New York (1982)). The hypersensitive response elicited by bacteria is readily 
30 observed as a tissue collapse if high concentrations (> 10 7 cells/ml) of a limited 

host-range pathogen like Pseudomonas syringae or Erwinia amylovora are infiltrated 
into the leaves of nonhost plants (necrosis occurs only in isolated plant cells at lower 
levels of inoculum) (Klement, Z., "Rapid Detection of Pathogenicity of 
Phytopathogenic Pseudomonads," Nature 199:299-300; Klement, et al., 
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occur. 



The hypersensitive response ("HR") is a rapid, localized necrosis that 
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"Hypersensitive Reaction Induced by Phytopathogenic Bacteria in the Tobacco Leaf," 
Phytopathology 54:474-477 (1963); Turner, et ah, 'The Quantitative Relation 
Between Plant and Bacterial Cells Involved in the Hypersensitive Reaction," 
Phytopathology 64:885-890 (1974); Klement, Z., "Hypersensitivity," pages 149-177 
5 in Phytopathogenic Prokarvotes , Vol. 2., M.S. Mount and G.H. Lacy, ed. Academic 
Press, New York (1982)). The capacities to elicit the hypersensitive response in a 
nonhost and be pathogenic in a host appear linked. As noted by Klement, Z., 
"Hypersensitivity," pages 149-177 in Phytopathogenic Prokaryotes , Vol. 2., M.S. 
Mount and G.H. Lacy, ed. Academic Press, New York, these pathogens also cause 
10 physiologically similar, albeit delayed, necroses in their interactions with compatible 
hosts. Furthermore, the ability to produce the hypersensitive response or pathogenesis 
is dependent on a common set of genes, denoted hrp (Lindgren, P.B., et al., "Gene 
Cluster of Pseudomonas syringae pv. 'phaseolicola' Controls Pathogenicity of Bean 
Plants and Hypersensitivity on Nonhost Plants," J. BacterioL 168:512-22 (1986); 
15 Willis, D.K., et al., "hrp Genes of Phytopathogenic Bacteria," Mol. Plant-Microbe 
Interact . 4: 1 32- 1 38 (1 99 1 )). Consequently, the hypersensitive response may hold 
clues to both the nature of plant defense and the basis for bacterial pathogenicity. 

The hrp genes are widespread in gram-negative plant pathogens, where 
they are clustered, conserved, and in some cases interchangeable (Willis, D.K., et al., 
20 "hrp Genes of Phytopathogenic Bacteria," Mol. Plant-Microbe Interact . 4:132-138 

(1991); Bonas, U., "hrp Genes of Phytopathogenic Bacteria," pages 79-98 in: Current 
Topics in Microbiology and Immunology: Bacterial Pathogenesis of Plants and 
Animals - Molecular and Cellular Mechanisms , J.L. Dangl, ed. Springer- Verlag, 
Berlin (1994)). Several hrp genes encode components of a protein secretion pathway 
25 similar to one used by Yersinia, Shigella, and Salmonella spp. to secrete proteins 
essential in animal diseases (Van Gijsegem, et al., "Evolutionary Conservation of 
Pathogenicity Determinants Among Plant and Animal Pathogenic Bacteria," Trends 
Microbiol. 1:175-180 (1993)). InE. amylovora, P. syringae, and P. solanacearum, 
hrp genes have been shown to control the production and secretion of glycine-rich, 
30 protein elicitors of the hypersensitive response (He, S.Y., et al. "Pseudomonas 

Syringae pv. Syringae Harpin Pss : a Protein that is Secreted via the Hrp Pathway and 
Elicits the Hypersensitive Response in Plants," Cell 73:1255-1266 (1993), Wei, Z.-H., 
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et al., "HrpI of Erwinia amylovora Functions in Secretion of Harpin and is a Member 
ofaNew Protein Family," J. Bacteriol. 175:7958-7967 (1993); Arlat, M. et al. 
"PopAl, a Protein Which Induces a Hypersensitive-like Response on Specific Petunia 
Genotypes, is Secreted via the Hrp Pathway of Pseudomonas solanacearum," EMBO 

5 J. 13:543-553 (1994)). 

The first of these proteins was discovered in E. amylovora Ea321, a 
bacterium that causes fire blight of rosaceous plants, and was designated harpin (Wei, 
Z.-M., et al, "Harpin, Elicitor of the Hypersensitive Response Produced by the Plant 
Pathogen Erwinia amylovora," Science 257:85-88 (1992)). Mutations in the encoding 

10 hrpN gene revealed that the hypersensitive response elicitor is required for E 

amylovora to elicit a hypersensitive response in nonhost tobacco leaves and incite 
disease symptoms in highly susceptible pear fruit. The P. solanacearum GMI1000 
PopAl protein has similar physical properties and also elicits the hypersensitive 
response in leaves of tobacco, which is not a host of that strain (Arlat, et al. "PopAl , a 

15 Protein Which Induces a Hypersensitive-like Response on Specific Petunia 

Genotypes, is Secreted via the Hrp Pathway of Pseudomonas solanacearum," EMBO 
J 13:543-53 (1994))- However, P. solanacearum popA mutants still elicit the 
hypersensitive response in tobacco and incite disease in tomato. Thus, the role of 
these glycine-rich hypersensitive response elicitors can vary widely among 

20 gram-negative plant pathogens. 

Other plant pathogenic hypersensitive response elicitors have been 
isolated and their encoding genes have been cloned and sequenced. These include: 
Erwinia chrysanthemi (Bauer, et. al., "Erwinia chrysanthemi Harpin Ech : Soft-Rot 
Pathogenesis," MPMI 8(4): 484-91 (1995)); Erwinia carotovora (Cui, et. al., "The 

25 RsmA" Mutants of Erwinia carotovora subsp. carotovora Strain Ecc71 Overexpress 
/irpN Ecc and Elicit a Hypersensitive Reaction-like Response in Tobacco Leaves," 
MPMI 9(7) : 565-73 (1966)); Erwinia stewartii (Ahmad, et. al., "Harpin is not 
Necessary for the Pathogenicity of Erwinia stewartii on Maize," 8th Tnfl. Cong. 
Mow. Pl«nt-Microb. Inter. July 14-19, 1996 and Ahmad, et. al., "Harpin is not 

30 Necessary for the Pathogenicity of Erwinia stewartii on Maize," Ann. Mtg. Am. 
Phvtopath. Soc. July 27-31, 1996); and Pseudomonas syringae pv. syringae (WO 
94/26782 to Cornell Research Foundation, Inc.). 
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The present invention is a further advance in the effort to identify, 
clone, and sequence hypersensitive response elicitor proteins or polypeptides from 
plant pathogens. 

5 SUMMARY OF THE INVENTION 



The present invention is directed to an isolated protein or polypeptide 
which elicits a hypersensitive response in plants as well as an isolated DNA molecule 
which encodes the hypersensitive response eliciting protein or polypeptide. 

1 0 The hypersensitive response eliciting protein or polypeptide can be 

used to impart disease resistance to plants, to enhance plant growth, and/or to control 
insects. This involves applying the hypersensitive response elicitor protein or 
polypeptide in a non-infectious form to plants or plant seeds under conditions 
effective to impart disease resistance, to enhance plant growth, and/or to control 

1 5 insects on plants or plants grown from the plant seeds. 

As an alternative to applying the hypersensitive response elicitor 
protein or polypeptide to plants or plant seeds in order to impart disease resistance, to 
enhance plant growth, and/or to control insects on plants, transgenic plants or plant 
seeds can be utilized. When utilizing transgenic plants, this involves providing a 

20 transgenic plant transformed with a DNA molecule engoding a hypersensitive 
response elicitor protein or polypeptide and growing the plant under conditions 
effective to impart disease resistance, to enhance plant growth, and/or to control 
insects in the plants or plants grown from the plant seeds. Alternatively, a transgenic 
plant seed transformed with the DNA molecule encoding a hypersensitive response 

25 elicitor protein or polypeptide can be provided and planted in soiL A plant is then 
propagated under conditions effective to impart disease resistance, to enhance plant 
growth, and/or to control insects on plants or plants grown from the plant seeds. 



30 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-D show mutagenesis, complementation and heterologous 
expression constructs, and homology with and complementation of mutants by the 
avrE locus of P. syringae for the dspE operon of E. amylovora. Dashed boxes are 
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uncharacterized ORFs; a filled triangle indicates a hrp (i.e. hypersensitive response 
elicitor encoding gene); box is a regulatory sequence that preceeds many hrp genes; 
and an open triangle indicates another promoter. Thick lines delineate the DNA for 
which sequence was accessioned. Figure 1 A shows the dsp/hrp gene cluster of 
E. amylovora in pCPP430. Operon names and types of proteins encoded are indicated 
at the top. B, BamYil; E, EcoRl; H, Hindlll. Half-arrows indicate internal promoters 
without similarity to the hrp box consensus. Figure IB shows the region downstream 
of hrpH containing the dspE operon. Circles mark deletion mutations and 
representative transposon insertions: black, non-pathogenic and HR + (i.e. 
hypersensitive response eliciting) or HR reduced (dsp); gTay, reduced virulence and 
HR; white, wild-type. T104 lies within the area marked by the dashed double arrow. 
K, 7w5miniKm; P, 7>?5phoA; T, TVilOtef; A, deletion mutation. The gray box is A/T- 
rich DNA. Figure 1C shows the clones and subclones of the dspE operon. Plasmid 
designations are indicated at the left, and vector-borne promoters at the right. 
Restriction sites used for subcloning not shown above are shown in parentheses. A 
"+" aligned with a circle representing a mutation in B indicates that the subclone 
complements that mutation. Figure ID shows the avrE locus (transcription units III 
and IV) of P. syringae pv. tomato DC3000 in pCPP2357. Percent amino acid identity 
of the predicted proteins AvrE and AvrF to DspE and DspF, respectively, are 
indicated. Black rectangles are transcriptional terminators (inverted repeats). 
Complementation of mutations shown in Figure IB are depicted as in Figure 1C, 
above. 

Figure 2 shows the expression of the full-length and the N-terminal 
half of DspE in recombinant E. coli DH5oc. Lysates of cells carrying either 
pCPP1259, containing the entire dspE operon (lane A); pCPP50, the cloning vector 
(lane B); or pCPP1244, containing only the 5' half of the dspE gene (lane C), were 
subjected to SDS-PAGE followed by Coomassie staining. Bands corresponding to 
DspE (lane A) and the N-terminal half of DspE (lane C) are marked by arrows. 
Migration of molecular weight markers is indicated on the left. 

Figures 3A-D show the role of dspe in pathogenicity and HR 
elicitation. Figure 3 A shows immature pear fruit 4 days after inoculation with (left to 
right) strains Ea321, Ea321<Zsp£M554, or Ea321<fcp£A1554 harboring the 5' half of 
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dspE on pCPP 1237. Figure 3B shows Norchief soybean leaf 24 hr after infiltration 
with (1) Ea321, (2) Ea321dsp£A1554, (3) Ea321hrpN::Tn5 (Wei, et al., Science , 
257:85-88 (1992), which is hereby incorporated by reference), and 
(4) Ea321hrpL::7>?5 (Wei, et al. s J. BacterioL , 177:6201-10 (1995), which is hereby 
5 incorporated by reference). Figure 3C shows a tobacco leaf 48 hr after infiltration 
with parallel dilution series of suspensions of strains (left) Ea321 and (right) 
Ea321<fcp£A1554. The concentrations infiltrated (top to bottom) are 1 x 10 10 , 1 x 10 9 , 
5 x 10 s , and 5 x 10 7 cfu/ml. Figure 3D is like Figure 3C except the more virulent 
strain Ea273 and corresponding mutant Ea273^p£A1554 were used, and 

10 concentrations ranged from 5 x 10 9 to 5 x 10 5 cfu/ml in log increments. 

Figure 4 shows the expression of a promoterless GUS construct fused 
to dspE in E. amylovora Ea273. Ea273 and Ea273dspE::uidA (a merodiploid 
containing both a wild-type dspE and a truncated dspE fused to the uidA gene; black 
bars) were grown in LB or Hxp MM, or inoculated to immature pear fruit. 

15 Ea273dspE:;uidAhrpL::Tn5 (dark gray bar) and Ed273hrcV::Tn5uidA (light gray bar) 
were also grown in hrp MM. Values shown represent means of triplicate samples 
normalized for bacterial cell concentration. Standard deviations are represented by 
lines extending from each bar. The mean values for three samples of Ea273 in each 
assay were subtracted from, and standard deviations added to, the corresponding 

20 values obtained for the other strains. 

Figures 5A-C show the transgeneric avirulence function of the dspE 
operon and complementation of a dspE mutant with the avrE locus. Norchief soybean 
leaves were either (See Figure 5 A) infiltrated with 1 x 10 s cfu/ml suspensions of (left) 
P. syringae pv. glycinea race 4 carrying pCPP1250 (containing the dspE operon) or 

25 (right) pML 122 (the cloning vector) and photographed after 24 hr at room 

temperature or (See Figure 5B) infiltrated with 8 x 10 5 cfu/ml suspensions of the same 
strains and photographed after seven days at 22° C and high relative humidity. Tissue 
collapse is apparent on both leaves where the strain carrying pCPP1250 was 
infiltrated. On the leaf incubated for seven days, chlorosis extending beyond the 

30 infiltrated area, typical of disease, is apparent on the half infiltrated with the strain 
carrying the vector only. The dark section on the side of the leaf infiltrated with the 
strain carrying pCPP1250 is a shadow caused by a buckle in the leaf. Figure 5C 
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shows pear halves inoculated with (left to right) Ea321 , Ea321dy/?£A1521(pCPP2357, 
containing the avrE locus), or Ea321^/?£A1521(pCPP2357 'avrE: :Tn5uidA) and 
photographed after seven days. Although symptoms are greatly reduced relative to 
wild type, necrosis is apparent around and drops of ooze can be seen within the well 
of the fruit inoculated with the dspE strain carrying the intact avrE locus. The fruit 
inoculated with the dspE strain carrying a disrupted clone of avrE is symptomless. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated DNA molecule having a 
nucleotide sequence of SEQ. ID. No. 1 as follows: 

ATGGAATTAA AATCACTGGG AACTGAACAC AAGGCGGCAG TACACACAGC GG CGCACAAC 6 0 

CCTGTGGGGC ATGGTGTTGC CTTACAG CAG GGCAGCAGCA GCAGCAGCCC GCAAAATGCC 12 0 

GCTGCATCAT TGGCGGCAGA AGGCAAAAAT CGTGGGAAAA TGCCGAGAAT TCACCAGCCA 18 0 

TCTACTGCGG CTGATGGTAT CAGCGCTGCT CACCAGCAAA AGAAATCCTT CAGTCTCAGG 24 0 

GGCTGTTTGG GGACGAAAAA ATTTTCCAGA TCGGCACCGC AGGGCCAGCC AGGTACCACC 3 00 

CACAGCAAAG GGGCAACATT GCGCGATCTG CTGGCGCGGG ACGACGGCGA AACG CAG CAT 3 60 

GAGGCGGCCG CGCCAGATGC GGCG CGTTTG ACCCGTTCGG GCGGCGTCAA ACGCCGCAAT 42 0 

ATGGACGACA TGGCCGGGCG GCCAATGGTG AAAGGTGGCA GCGGCGAAGA TAAGGTACCA 48 0 

ACGCAGCAAA AACGG CAT CA G C TGAACAAT TTTGGCCAGA TGCGCCAAAC GATGTTGAGC 54 0 

AAAATGGCTC ACCCGGCTTC AGCCAACGCC GGCGATCGCC TGCAGCATTC ACCGCCGCAC 6 00 

ATCCCGGGTA GCCACCACGA AATCAAGGAA GAACCGGTTG GCTCCACCAG CAAGGCAACA 660 

ACGGCCCACG CAGACAGAGT GGAAATCGCT CAGGAAGATG ACGACAGCGA ATT C CAG C AA 72 0 

CTGCATCAAC AGCGGCTGGC GCGCG AACGG GAAAATCCAC CGCAGCCGCC CAAACTCGGC 780 

GTTGCCACAC CGATTAGCGC CAGGTTTCAG C C CAAACTG A CTGCGGTTGC GGAAAGCGTC 84 0 

CTTGAGGGGA CAGATACCAC GCAGTCACCC CTTAAGCCGC AATCAATGCT GAAAGGAAGT 900 

GG AG C CGGGG TAACGCCGCT GGCGGTAACG CTGGATAAAG GCAAGTTGCA GCTGGCACCG 960 

GATAATCCAC CCGCGCTCAA TACGTTGTTG AAG CAG A CAT TGGGTAAAGA CACCCAGCAC 1020 

TATCTGGCGC ACCATGCCAG CAGCGACGGT AGCCAGCATC TGCTGCTGGA CAACAAAGGC 10 8 0 

CACCTGTTTG AT AT CAAAAG CACCGCCACC AGCTATAGCG TGCTGCACAA CAGCCACCCC 114 0 

GGTGAGATAA AGGGCAAGCT GGCGCAGGCG GGTACTGGCT CCGTCAGCGT AGACGGTAAA 12 00 



1260 
1320 
1380 
1440 
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AGCGGCAAGA TCTCGCTGGG GAG CGGTACG CAAAGTCACA ACAAAACAAT GCTAAGCCAA 
CCGGGGGAAG CGCACCGTTC CTTATTAACC GGCATTTGGC AGCATCCTGC TGGCGCAGCG 
5 CGGCCGCAGG GCGAGTCAAT CCGCCTGCAT GACGACAAAA TTCATATCCT GCATCCGGAG 
CTGGGCGTAT GGCAATCTGC GGATAAAGAT ACCCACAGCC AGCTGTCTCG CCAGGCAGAC 

^ GGTAAGCTCT ATGCGCTGAA AGACAACCGT ACCCTGCAAA ACCTCTCCGA TAATAAATCC 1 50 ( 

TCAGAAAAGC TGGTCGATAA AATCAAATCG TATTCCGTTG ATCAGCGGGG GCAGGTGGCG 156C 

ATCCTGACGG ATACTCCCGG CCGCCATAAG ATGAGTATTA TGCCCTCGCT GGATGCTTCC 162C 

15 CCGGAGAGCC ATATTTCCCT CAGCCTGCAT TTTGCCGATG CCCACCAGGG GTTATTGCAC 16 8 C 

GGGAAGTCGG AGCTTGAGGC ACAATCTGTC GCGATCAGCC ATGGGCGACT GGTTGTGGCC 17 4D 

GATAGCGAAG GCAAGCTGTT TAGCGCCGCC ATTCCGAAGC AAGGGGATGG AAACGAACTG 1800 

AAAATGAAAG CCATGCCTCA GCATGCGCTC GATGAACATT TTGGTCATGA CCACCAGATT 18 60 

TCTGGATTTT TCCATGACGA CCACGGCCAG CTTAATGCGC TGGTGAAAAA TAACTTCAGG 192 0 

25 CAGCAGCATG CCTGCCCGTT GGGTAACGAT CATCAGTTTC ACCCCGGCTG GAACCTGACT 198 0 

GATGCGCTGG TTATCGACAA TCAGCTGGGG CTGCATCATA CCAATCCTGA ACCGCATGAG 2 04 0 

ATTCTTGATA TGGGGCATTT AGGCAGCCTG GCGTTACAGG AGGGCAAGCT TCACTATTTT 2100 

GACCAGCTGA CCAAAGGGTG GACTGGCGCG GAGTCAGATT GTAAGCAGCT GAAAAAAGGC 2160 

CTGGATGGAG CAGCTTATCT ACTGAAAGAC GGTGAAGTGA AACGCCTGAA TATTAATCAG 22 2 0 

35 AGCACCTCCT CTATCAAGCA CGGAACGGAA AACGTTTTTT CGCTGCCGCA TGTGCGCAAT 228 0 

AAACCGGAGC CGGGAGATGC CCTGCAAGGG CTGAATAAAG ACGATAAGGC CCAGGCCATG 23 4 0 

4o G CGGTGATTG GGGTAAATAA ATACCTGGCG CTGACGGAAA AAGGGGACAT TCGCTCCTTC 24 0 0 

CAGATAAAAC CCGGCACCCA GCAGTTGGAG CGGCCGGCAC AAACTCTCAG CCGCGAAGGT 24 60 

ATCAGCGGCG AACTGAAAGA CATTCATGTC GACCACAAGC AGAACCTGTA TGCCTTGACC 252 0 

45 CACGAGGGAG AGGTGTTTCA TCAGCCGCGT GAAGCCTGGC AGAATGGTGC CGAAAGCAGC 2 5 80 

AGCTGGCACA AACTGGCGTT GCCACAGAGT GAAAGTAAGC TAAAAAGTCT GGACATGAGC 264 0 

CATGAGCACA AACCGATTGC CACCTTTGAA GACGGTAGCC AGCATCAGCT GAAGGCTGGC 270 0 

GGCTGGCACG CCTATGCGGC ACCTGAACGC GGGCCGCTGG CGGTGGGTAC CAG CGGTTCA 27 6 0 

CAAACCGTCT TTAACCGACT AATGCAGGGG GTGAAAGGCA AGGTGATCCC AGGCAGCGGG 2 820 

55 TTGACGGTTA AGCTCTCGGC TCAGACGGGG GGAATGACCG GCGCCGAAGG GCGCAAGGTC 2880 

AGCAGTAAAT TTTCCGAAAG GATCCGCGCC TATGCGTTCA ACCCAACAAT GTCCACGCCG 294 0 

6q CGACCGATTA AAAATGCTGC TTATGCCACA CAGCACGGCT GGCAGGGGCG TGAGGGGTTG 3 0 00 

AAGCCGTTGT ACGAGATGCA GGGAGCGCTG ATTAAACAAC TGGATGCGCA TAACGTTCGT 3 060 
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CATAACGCGC CACAGCCAGA TTTGCAGAGC AAACTGGAAA CTCTGGATTT AGGCGAACAT 3120 

GGCGCAGAAT TGCTTAACGA CATGAAGCGC TTCCGCGACG AACTGGAGCA GAGTGCAACC 3180 

5 CGTTCGGTGA CCGTTTTAGG T CAACAT CAG GGAGTGCTAA AAAG CAACGG TGAAATCAAT 3 24 0 

AGCGAATTTA AGCCATCGCC CGGCAAGGCG TTGGTCCAGA GCTTTAACGT CAATCGCTCT 3 3 00 

GG T CAG GAT C TAAGCAAGTC ACTGCAACAG GCAGTACATG CCACGCCGCC AT C CG CAG AG 3 3 60 

10 

AGTAAACTGC AATCCATGCT GGGGCACTTT GTCAGTGCCG GGGTGGATAT GAG T CAT CAG 3 420 

AAGGG CG AG A TCCCGCTGGG CCGCCAGCGC GATCCGAATG AT AAAAC CG C ACTG AC CAAA 34 8 0 

15 TCGCGTTTAA TTTTAGATAC CGTGACCATC GGTGAACTGC ATGAACTGGC CGATAAGGCG 3 54 0 

AAACTGGTAT C TG AC CAT AA ACCCGATGCC GAT C AG AT AA AACAGCTGCG CCAGCAGTTC 3 600 

GATACGCTGC GTGAAAAGCG GTATGAGAGC AATCCGGTGA AG CATTACAC CGATATGGGC 3 66 0 

20 

TTCACCCATA ATAAGGCGCT GGAAGCAAAC TATGATGCGG TCAAAGCCTT TATCAATGCC 3 72 0 

TT T AAG AAAG AGCACCACGG CGTCAATCTG ACCACGCGTA CCGTACTGGA ATCACAGGGC 3 7 80 

25 AGTGCGG AG C TGGCGAAGAA GCTCAAGAAT ACGCTGTTGT CCCTGGACAG TGGTGAAAGT 3 84 0 

ATGAGCTTCA G C CGG T CAT A TGGCGGGGGC GTCAGCACTG TCTTTGTGCC TAC C C TT AG C 3 9 00 

AAGAAGGTGC CAGTTCCGGT GATCCCCGGA GCCGGCATCA CGCTGGATCG CGCCTATAAC 3 96 0 

30 

CTGAGCTTCA GTCGTACCAG CGGCGGATTG AACGTCAGTT TTGGCCGCGA CGGCGGGGTG 4 02 0 

AG TGG TAACA TCATGGTCGC TAC CGG C CAT GATGTGATGC CCTATATGAC CGG T AAG AAA 4 080 

35 AC CAG TG CAG GTAACGCCAG TGACTGGTTG AG CG C AAAAC AT AAAAT CAG CCCGGACTTG 414 0 

CGTATCGGCG CTGCTGTGAG TGGCACCCTG CAAGGAACGC TACAAAACAG CCTGAAGTTT 420 0 

AAGCTGACAG AGGATGAGCT GCCTGGCTTT ATCCATGGCT TGACGCATGG CACGTTGACC 4 2 60 

40 

C CGG CAG AAC TGTTGCAAAA GGGGATCGAA CAT CAG ATG A AG CAG GG CAG CAAACTGACG 43 20 

TTTAGCGTCG AT AC CT CGG C AAATCTGGAT CTGCGTGCCG GTATCAATCT GAACGAAGAC 43 8 0 

45 GGCAGTAAAC CAAATGGTGT CACTGCCCGT GTTTCTGCCG GGCTAAGTGC AT CGG CAAAC 44 4 0 

CTGGCCGCCG GCTCGCGTGA ACGCAGCACC ACCTCTGGCC AG TT TGG CAG CACGACTTCG 4 500 

GCCAGCAATA ACCGCCCAAC CTTCCTCAAC GGGGTCGGCG CGGGTGCTAA CCTGACGGCT 4 560 

50 GCTTTAGGGG TTGCCCATTC ATCTACGCAT GAAGGGAAAC CGGTCGGGAT CTTCCCGGCA 4 62 0 

TTTACCTCGA CCAATGTTTC GG CAG CG CTG GCGCTGGATA ACCGTACCTC ACAGAGTATC 46 8 0 

55 AGCCTGGAAT TGAAGCGCGC GGAGCCGGTG AC CAG CAACG AT AT CAG CG A GTTGACCTCC 4740 

ACGCTGGGAA AACACTTTAA GG AT AG CG C C ACAACGAAGA TGCTTGCCGC T CT CAAAG AG 4 8 00 

TTAGATGACG CTAAGCCCGC TGAACAACTG CATATTTTAC AG CAG CATTT CAGTGCAAAA 4 86 0 

60 GATGTCGTCG GTGATGAACG CTACGAGGCG GTGCGCAACC TGAAAAAACT GGTGATACGT 4 92 0 
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PA APAGGOTG 


CGGACAGCCA 


CAG CATGG AA 


TTAGGATCTG 


CCAGTCACAG 


CACGACCTAC 


4980 


A ATA ATPTGT 


CGAGAATAAA 


TAATGACGGC 


AT TG T CG AG C 


TGCTACACAA 


ACATTTCGAT 


5040 


nrnnrATTAC 


CAGCAAGCAG 


TGCCAAACGT 


CTTGGTGAAA 


TGATGAATAA 


CGATCCGGCA 


5100 


TTfiAAAGATA 

X VjAriiiVjri x -£ * 


TTATTAAGCA 


GCTGCAAAGT 


ACGCCGTTCA 


GCAGCGCCAG 


CGTGTCGATG 


5160 


HAG C TG AAAG 


ATGGTCTGCG 


TGAGCAGACG 


GAAAAAGCAA 


TACTGGACGG 


TAAGGTCGGT 


5220 


CGTGAAGAAG 


TGGGAGTACT 


TTTCCAGGAT 


CGTAACAACT 


TGCGTGTTAA 


ATCGGTCAGC 


5280 




CCGTCAGCAA 


AAGCGAAGGC 


TTCAATACCC 


CAG CG CTGTT 


ACTGGGGACG 


5340 




CTGCTATGAG 


CATGGAGCGC 


AACATCGGAA 


CCATTAATTT 


TAAATACGGC 


5400 


PAnGATCAGA 


ACACCCCACG 


GCGATTTACC 


CTGGAGGGTG 


GAATAGCTCA 


GGCTAATCCG 


5460 


CAGGTCGCAT 


CTGCGCTTAC 




AAGGAAGGGC 


TGGAAATGAA 


GAGCTAA 


5517 



This DNA molecule is known as the dspE gene. This isolated DNA molecule of the 
present invention encodes a protein or polypeptide which elicits a plant pathogen's 
hypersensitive response having an amino acid sequence of SEQ. ID. No. 2 as follows: 

Met Glu Leu Lys Ser Leu Gly Thr Glu His Lys Ala Ala Val His Thr 
! 5 10 15 

Ala Ala His Asn Pro Val Gly His Gly Val Ala Leu Gin Gin Gly Ser 
20 25 30 

Ser Ser Ser Ser Pro Gin Asn Ala Ala Ala Ser Leu Ala Ala Glu Gly 
35 40 45 

Lys Asn Arg Gly Lys Met Pro Arg He His Gin Pro Ser Thr Ala Ala 
50 55 60 



Asp Gly He Se 



Ala Ala His Gin Gin Lys Lys Ser Phe Ser Leu Arg 



65 70 75 80 

Gly Cys Leu Gly Thr Lys Lys Phe Ser Arg Ser Ala Pro Gin Gly Gin 
85 90 95 

Pro Gly Thr Thr His Ser Lys Gly Ala Thr Leu Arg Asp Leu Leu Ala 
100 105 HO 

Arq Asp Asp Gly Glu Thr Gin His Glu Ala Ala Ala Pro Asp Ala Ala 
115 120 125 

Arg Leu Thr Arg Ser Gly Gly Val Lys Arg Arg Asn Met Asp Asp Met 
130 135 140 

Ala Gly Arg Pro Met Val Lys Gly Gly Ser Gly Glu Asp Lys Val Pro 
145 150 155 160 

Thr Gin Gin Lys Arg His Gin Leu Asn Asn Phe Gly Gin Met Arg Gin 
165 170 175 



10 



20 



25 



35 



40 



50 



55 
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Thr Met Leu Ser Lys Met Ala His Pro Ala Ser Ala Asn Ala Gly Asp 
180 185 190 

Arg Leu Gin His Ser Pro Pro His lie Pro Gly Ser His His Glu He 
195 200 205 

Lys Glu Glu Pro Val Gly Ser Thr Ser Lys Ala Thr Thr Ala His Ala 
210 2 15 220 

Asp Arg Val Glu He Ala Gin Glu Asp Asp Asp Ser Glu Phe Gin Gin 
225 230 235 240 

Leu His Gin Gin Arg Leu Ala Arg Glu Arg Glu Asn Pro Pro Gin Pro 
245 250 255 

Pro Lys Leu Gly Val Ala Thr Pro lie Ser Ala Arg Phe Gin Pro Lys 
260 265 



270 



Leu Thr Ala Val Ala Glu Ser Val Leu Glu Gly Thr Asp Thr Thr Gin 
275 280 285 

Ser Pro Leu Lys Pro Gin Ser Met Leu Lys Gly Ser Gly Ala Gly Val 
290 295 300 

Thr Pro Leu Ala Val Thr Leu Asp Lys Gly Lys Leu Gin Leu Ala Pro 
305 310 315 320 

Asp Asn Pro Pro Ala Leu Asn Thr Leu Leu Lys Gin Thr Leu Gly Lys 
30 325 330 335 

Asp Thr Gin His Tyr Leu Ala His His Ala Ser Ser Asp Gly Ser Gin 
340 345 350 

His Leu Leu Leu Asp Asn Lys Gly His Leu Phe Asp He Lys Ser Thr 
355 3 6 o 365 

Ala Thr Ser Tyr Ser Val Leu His Asn Ser His Pro Gly Glu He Lys 
370 375 380 

Gly Lys Leu Ala Gin Ala Gly Thr Gly Ser Val Ser Val Asp Gly Lys 

385 390 395 * * 

400 

Ser Gly Lys He Ser Leu Gly Ser Gly Thr Gin Ser His Asn Lys Thr 
45 405 410 415 

Met Leu Ser Gin Pro Gly Glu Ala His Arg Ser Leu Leu Thr Gly He 
420 425 43Q 

Trp Gin His Pro Ala Gly Ala Ala Arg Pro Gin Gly Glu Ser He Arq 
43 5 440 445 

Leu His Asp Asp Lys He His He Leu His Pro Glu Leu Gly Val Trn 
450 455 460 V 

Gin Ser Ala Asp Lys Asp Thr His Ser Gin Leu Ser Arg Gin Ala Asp 

475 

Gly Lys Leu Tyr Ala Leu Lys Asp Asn Arg Thr Leu Gin Asn Leu Ser 
485 490 495 

Asp Asn Lys Ser Ser Glu Lys Leu Val Asp Lys He Lys Ser Tyr Ser 



505 510 
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Val Asp Gin Arg Gly Gin Val Ala He Leu Thr Asp Thr Pro Gly Arg 
515 520 525 

His Lys Met Ser He Met Pro Ser Leu Asp Ala Ser Pro Glu Ser His 
530 535 540 

He Ser Leu Ser Leu His Phe Ala Asp Ala His Gin Gly Leu Leu His 
545 550 555 5 60 

Gly Lys Ser Glu Leu Glu Ala Gin Ser Val Ala He Ser His Gly Arg 
565 570 575 

Leu Val Val Ala Asp Ser Glu Gly Lys Leu Phe Ser Ala Ala He Pro 
580 585 590 

Lys Gin Gly Asp Gly Asn Glu Leu Lys Met Lys Ala Met Pro Gin His 
595 600 605 

Ala Leu Asp Glu His Phe Gly His Asp His Gin He Ser Gly Phe Phe 
610 615 620 

His Asp Asp His Gly Gin Leu Asn Ala Leu Val Lys Asn Asn Phe Arg 
^25 630 635 640 

Gin Gin His Ala Cys Pro Leu Gly Asn Asp His Gin Phe His Pro Gly 
645 650 655 

Trp Asn Leu Thr Asp Ala Leu Val He Asp Asn Gin Leu Gly Leu His 
660 665 670 

His Thr Asn Pro Glu Pro His Glu He Leu Asp Met Gly His Leu Gly 
675 680 685 

Ser Leu Ala Leu Gin Glu Gly Lys Leu His Tyr Phe Asp Gin Leu Thr 
690 695 700 

Lys Gly Trp Thr Gly Ala Glu Ser Asp Cys Lys Gin Leu Lys Lys Gly 
705 710 715 720 

Leu Asp Gly Ala Ala Tyr Leu Leu Lys Asp Gly Glu Val Lys Arg Leu 
725 730 735 

Asn He Asn Gin Ser Thr Ser Ser He Lys His Gly Thr Glu Asn Val 
740 745 750 

Phe Ser Leu Pro His Val Arg Asn Lys Pro Glu Pro Gly Asp Ala Leu 
755 760 765 

Gin Gly Leu Asn Lys Asp Asp Lys Ala Gin Ala Met Ala Val He Gly 
770 775 780 

Val Asn Lys Tyr Leu Ala Leu Thr Glu Lys Gly Asp He Arg Ser Phe 
785. 790 795 8 00 

Gin He Lys Pro Gly Thr Gin Gin Leu Glu Arg Pro Ala Gin Thr Leu 
805 810 815 

Ser Arg Glu Gly He Ser Gly Glu Leu Lys Asp He His Val Asp His 
820 825 830 



- 13 - 

Lys Gin Asa Leu Tyr Ala Leu Thr His Glu Gly Glu Val Phe His Gin 
835 840 845 

Pro Arg Glu Ala Trp Gin Asn Gly Ala Glu Ser Ser Ser Trp His Lys 



860 

865 !" LyS LeU ^ S Ser I-- Met Ser 

O/U QIC 

b75 880 



His Glu His Lys Pro lie Ala Thr Phe Glu Asp Gly Ser Gin His Gin 
885 890 895 

Leu Lys Ala Gly Gly Trp H is Ala Tyr Ala Ala Pro Glu Arg Gly Pro 

Leu Ala Val Gly Thr Ser Gly Ser Gin Thr Val Phe Asn Arg Leu Met 
915 920 925 

Gin Gly Val Lys Gly Lys Val He Pro Gly Ser Gly Leu Thr Val Lys 

935 940 

Leu Ser Ala Gin Thr Gly Gly Met Thr Gly Ala Glu Gly Arg Lys Val 

950 955 gee 

Ser Ser Lys Phe Ser Glu Arg He Arg Ala Tyr Ala Phe Asn Pro Thr 

970 975 

Met Ser Thr Pro Axg Pro He Lys Asn Ala Ala Tyr Ala Thr Gin His 
980 9 85 99Q 

Gly Trp Gin Gly Axg Glu Gly Leu Lys Pro Leu Tyr Glu Met Gin Gly 
"5 1000 1005 

Ala Leu lie Lys Gin Leu Asp Ala His Asn Val Arg His Asn Ala Pro 
1010 1015 1020 

Gin Pro Asp Leu Gin Ser Lys Leu Glu Thr Leu Asp Leu Gly Glu His 

1030 1035 1040 

Gly Ala Glu Leu Leu Asn Asp Met Lys Arg Phe Arg Asp Glu Leu Glu 

1045 "50 105 5 

Gin Ser Ala Thr Arg Ser Val Thr Val Leu Gly Gin His Gin Gly Val 

10 6 0 1 A(TC 

lUbb 1070 

Leu Lys Ser Asn Gly Glu lie Asn Ser Glu Phe Lys Pro Ser Pro Gly 

1075 1080 



1085 



^ ^O 1 ^ ASn Arg Ser ^ Gln Asp Leu 

.Luyu 1095 



1100 



Ser Lys Ser Leu Gln Gln Ala Val His Ala Thr Pro Pro Ser Ala Glu 

1110 1H5 nao 

Ser Lys Leu Gln Ser Met Leu Gly His Phe Val Ser Ala Gly Val Asp 
1125 H30 1135 

Met Ser His Gln Lys Gly Glu lie Pro Leu Gly Arg Gln Arg Asp Pro 

1140 iidc 

1145 1150 

Asn Asp Lys Thr Ala Leu Thr Lys Ser Arg Leu He Leu Asp Thr Val 
1155 H60 lies 
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Thr lie Gly Glu Leu His Glu Leu Ala Asp Lys Ala Lys Leu Val Ser 
1170 "75 H80 

Asp His Lys Pro Asp Ala Asp Gin He Lys Gin Leu Arg Gin Gin Phe 
1185 119° 1195 1200 

Asp Thr Leu Arg Glu Lys Arg Tyr Glu Ser Asn Pro Val Lys His Tyr 
1205 1210 "IS 

Thr Asp Met Gly Phe Thr His Asn Lys Ala Leu Glu Ala Asn Tyr Asp 
1220 1225 1230 



Ala Val Lys Ala Phe He Asn Ala Phe Lys Lys Glu His His Gly Val 
15 1235 1240 1245 

Asn Leu Thr Thr Arg Thr Val Leu Glu Ser Gin Gly Ser Ala Glu Leu 
1250 1255 1260 



r 



Ala Lys Lys Leu Lys Asn Thr Leu Leu Ser Leu Asp Ser Gly Glu Se 
1265 1270 1275 1280 

Met Ser Phe Ser Arg Ser Tyr Gly Gly Gly Val Ser Thr Val Phe Val 
1285 1290 

Pro Thr Leu Ser Lys Lys Val Pro Val Pro Val lie Pro Gly Ala Gly 
1300 I 305 1310 



He Thr Leu Asp Arg Ala Tyr Asn Leu Ser Phe Ser Arg Thr Ser Gly 
30 1315 1320 1325 

Gly Leu Asn Val Ser Phe Gly Arg Asp Gly Gly Val Ser Gly Asn He 
1330 1335 1340 

35 Met Val Ala Thr Gly His Asp Val Met Pro Tyr Met Thr Gly Lys Lys 

1345 1350 1355 "GO 

Thr Ser Ala Gly Asn Ala Ser Asp Trp Leu Ser Ala Lys His Lys He 
1365 1370 1375 



Ser Pro Asp Leu Arg He Gly Ala Ala Val Ser Gly Thr Leu Gin Gly 
1380 1385 1390 



Thr Leu Gin Asn Ser Leu Lys Phe Lys Leu Thr Glu Asp Glu Leu Pro 
45 1395 1400 1405 

Glv Phe He His Gly Leu Thr His Gly Thr Leu Thr Pro Ala Glu Leu 
, Y 1410 1415 1420 

50 Leu Gin Lys Gly He Glu His Gin Met Lys Gin Gly Ser Lys Leu Thr 

1425 1430 1435 "40 



Phe Ser Val Asp Thr Ser Ala Asn Leu Asp Leu Arg Ala Gly He Asn 
1445 1450 1455 

Leu Asn Glu Asp Gly Ser Lys Pro Asn Gly Val Thr Ala Arg Val Ser 
1460 1465 1470 



Ala Gly Leu Ser Ala Ser Ala Asn Leu Ala Ala Gly Ser Arg Glu Arg 
60 1475 1480 1485 




Ser Thr Thr Ser Gly Gin Phe Gly Ser Thr Thr Ser Ala Ser Asn Asn 
1490 1495 1500 

Arq Pro Thr Phe Leu Asn Gly Val Gly Ala Gly Ala Asn Leu Thr Ala 
15 05 1510 1515 1520 

Ala Leu Gly Val Ala His Ser Ser Thr His Glu Gly Lys Pro Val Gly 
15 25 1530 1535 

He Phe Pro Ala Phe Thr Ser Thr Asn Val Ser Ala Ala Leu Ala Leu 
1540 1545 1550 

Asp Asn Arg Thr Ser Gin Ser He Ser Leu Glu Leu Lys Arg Ala Glu 
1555 1560 1565 

Pro Val Thr Ser Asn Asp He Ser Glu Leu Thr Ser Thr Leu Gly Lys 
1570 1575 1580 

His Phe Lys Asp Ser Ala Thr Thr Lys Met Leu Ala Ala Leu Lys Glu 
1585 1590 1595 1600 

Leu Asp Asp Ala Lys Pro 



1585 1590 1595 

Ala Glu Gin Leu His He Leu Gin Gin His 



1605 



1610 1615 



Phe Ser Ala Lys Asp Val Val Gly Asp Glu Arg Tyr Glu Ala Val Arg 
1620 1625 1630 

Asn Leu Lys Lys Leu Val He Arg Gin Gin Ala Ala Asp Ser His Ser 
1635 164 0 1645 

Met Glu Leu Gly Ser Ala Ser His Ser Thr Thr Tyr Asn Asn Leu Ser 
1650 1655 1660 

Arg He Asn Asn Asp Gly He Val Glu Leu Leu His Lys His Phe Asp 
1665 1670 1675 1680 

Ala Ala Leu Pro Ala Ser Ser Ala Lys Arg Leu Gly Glu Met Met Asn 
1685 1690 1695 

Asn Asp Pro Ala Leu Lys Asp He He Lys Gin Leu Gin Ser Thr Pro 
1700 1705 1710 

Phe Ser Ser Ala Ser Val Ser Met Glu Leu Lys Asp Gly Leu Arg Glu 
1715 1720 1725 

Gin Thr Glu Lys Ala He Leu Asp Gly Lys Val Gly Arg Glu Glu Val 
1730 1735 1740 

Glv Val Leu Phe Gin Asp Arg Asn Asn Leu Arg Val Lys Ser Val Ser 
17 4 5 1750 1755 1760 

Val Ser Gin Ser Val Ser Lys Ser Glu Gly Phe Asn Thr Pro Ala Leu 
1765 1770 1775 

Leu Leu Gly Thr Ser Asn Ser Ala Ala Met Ser Met Glu Arg Asn He 
1780 1785 1790 

Glv Thr He Asn Phe Lys Tyr Gly Gin Asp Gin Asn Thr Pro Arg Arg 
1795 1800 1805 

Phe Thr Leu Glu Gly Gly He Ala Gin Ala Asn Pro Gin Val Ala Ser 
1810 1815 1820 
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Ala Leu Thr Asp Leu Lys Lys Glu Gly Leu Glu Met Lys Ser 
1825 1830 1835 

This protein or polypeptide is about 198 kDa and has a pi of 8.98. 

The present invention relates to an isolated DNA molecule having a nucleotide 
sequence of SEQ. ID. No. 3 as follows: 

ATGACATCGT CACAGCAGCG GGTTGAAAGG TTTTTACAGT ATTTCTCCGC CGGGTGTAAA 6 0 

ACGCCCATAC ATCTGAAAGA CGGGGTGTGC GCCCTGTATA ACGAACAAGA TGAGGAGGCG 12 0 

GCGGTGCTGG AAGTACCGCA ACACAGCGAC AG C CTGTTAC TACACTGCCG AATCATTGAG 180 

GCTGACCCAC AAACTTCAAT AACCCTGTAT TCGATGCTAT TACAGCTGAA TTTTGAAATG 24 0 

GCGGCCATGC GCGGCTGTTG GCTGGCGCTG GATGAACTGC ACAACGTGCG TTTATGTTTT 3 00 

CAGCAGTCGC TGG AG CAT CT GGATGAAGCA AGTTTTAGCG ATATCGTTAG CGGCTTCATC 3 60 

GAACATGCGG CAGAAGTGCG TG AG TAT ATA G CG CAAT TAG ACG AG AG TAG CGCGGCATAA 420 

This is known as the dspF gene. This isolated DNA molecule of the present invention 
encodes a hypersensitive response elicitor protein or polypeptide having an amino 
acid sequence of SEQ. ID. No. 4 as follows: 

Met Thr Ser Ser Gin Gin Arg Val Glu Arg Phe Leu Gin Tyr Phe Ser 
15 10 15 

Ala Gly Cys Lys Thr Pro lie His Leu Lys Asp Gly Val Cys Ala Leu 
20 25 30 

Tyr Asn Glu Gin Asp Glu Glu Ala Ala Val Leu Glu Val Pro Gin His 
35 40 45 

Ser Asp Ser Leu Leu Leu His Cys Arg lie lie Glu Ala Asp Pro Gin 
50 55 60 

Thr Ser lie Thr Leu Tyr Ser Met Leu Leu Gin Leu Asn Phe Glu Met 
65 70 75 80 

Ala Ala Met Arg Gly Cys Trp Leu Ala Leu Asp Glu Leu His Asn Val 
85 90 95 

Arg Leu Cys Phe Gin Gin Ser Leu Glu His Leu Asp Glu Ala Ser Phe 
100 105 110 

Ser Asp lie Val Ser Gly Phe lie Glu His Ala Ala Glu Val Arg Glu 
115 120 125 

Tyr lie Ala Gin Leu Asp Glu Ser Ser Ala Ala 
130 135 
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This protein or polypeptide is about 16 kDa and has a pi of 4.45. 

Fragments of the above hypersensitive response elicitor polypeptide or 
protein are encompassed by the present invention. 
5 Suitable fragments can be produced by several means. In the first, 

subclones of the gene encoding the elicitor protein of the present invention are 
produced by conventional molecular genetic manipulation by subcloning gene 
fragments. The subclones then are expressed in vitro or in vivo in bacterial cells to 
yield a smaller protein or peptide that can be tested for elicitor activity according to 

10 the procedure described below. 

As an alternative, fragments of an elicitor protein can be produced by 
digestion of a full-length elicitor protein with proteolytic enzymes like chymotrypsin 
or Staphylococcus proteinase A, or trypsin. Different proteolytic enzymes are likely 
to cleave elicitor proteins at different sites based on the amino acid sequence of the 

1 5 elicitor protein. Some of the fragments that result from proteolysis may be active 

elicitors of resistance. 

In another approach, based on knowledge of the primary structure of 
the protein, fragments of the elicitor protein gene may be synthesized by using the 
PGR technique together with specific sets of primers chosen to represent particular 
20 portions of the protein. These then would be cloned into an appropriate vector for 
increased expression of a truncated peptide or protein. 

Chemical synthesis can also be used to make suitable fragments. Such 
a synthesis is carried out using known amino acid sequences for the elicitor being 
produced. Alternatively, subjecting a full length elicitor to high temperatures and 
25 pressures will produce fragments. These fragments can then be separated by 
conventional procedures (e.g., chromatography, SDS-PAGE). 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of amino acids that have minimal influence on the properties, 
secondary structure and hydropathic nature of the polypeptide. For example, a 
30 polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end 
of the protein which co-translationally or post-translationally directs transfer of the 
protein. The polypeptide may also be conjugated to a linker or other sequence for 
ease of synthesis, purification, or identification of the polypeptide. 
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Suitable DNA molecules are those that hybridize to a DNA molecule 
comprising a nucleotide sequence of SEQ. ID. Nos. 1 and 3, under stringent 
conditions. An example of suitable high stringency conditions is when hybridization 
is carried out at 65°C for 20 hours in a medium containing 1M NaCl, 50 mM Tris- 
5 HC1, pH 7.4, 10 mM EDTA, 0.1% sodium dodecyl sulfate, 0.2% ficoll, 0.2% 
polyvinylpyrrolidone, 0.2% bovine serum albumin, 50 [im g/ml E. coli DNA. 
However, any DNA molecules hybridizing to a DNA molecule comprising the 
nucleotide sequences of SEQ. ID. Nos. 1 and 3, under such stringent conditions must 
not be identical to the nucleic acids encoding the hypersensitive response elicitor 

10 proteins or polypeptides of E. amylovora (as disclosed by Wei, Z.-M., et al, "Harpin, 
Elicitor of the Hypersensitive Response Produced by the Plant Pathogen Erwinia 
amylovora"" Science 257:85-88 (1992), which is hereby incorporated by reference), 
Erwinia chrysanthemi (as disclosed by Bauer, et. al., "Erwinia chrysanthemi 
Harpin Ech : Soft-Rot Pathogenesis," MPMI 8(4): 484-91 (1995), which is hereby 

15 incorporated by reference), Erwinia carotovora (as disclosed by Cui, et. al., "The 

RsmA" Mutants of Erwinia carotovora subsp. carotovora Strain Ecc71 Overexpress 
hrpN Ecc and Elicit a Hypersensitive Reaction-like Response in Tobacco Leaves," 
MPMI 9(7): 565-73 (1966), which is hereby incorporated by reference), Erwinia 
stewartii (as disclosed by Ahmad, et. al., "Harpin is not Necessary for the 

20 Pathogenicity of Erwinia stewartii on Maize," 8th Int'l. Cong. Molec. Plant-Microb. 
Inter. July 14-19, 1996 and Ahmad, et. al., "Harpin is not Necessary for the 
Pathogenicity of Erwinia stewartii on Maize," Ann. Mtg. Am. Phytopath. Soc. July 
27-3 1, 1996), which are hereby incorporated by reference), and Pseudomonas 
syringae pv. syringae (WO 94/26782 to Cornell Research Foundation, Inc., which is 

25 hereby incorporated by reference). 

The protein or polypeptide of the present invention is preferably 
produced in purified form (preferably at least about 80%, more preferably 90%, pure) 
by conventional techniques. Typically, the protein or polypeptide of the present 
invention is secreted into the growth medium of recombinant host cells. 

30 Alternatively, the protein or polypeptide of the present invention is produced but not 
secreted into growth medium. In such cases, to isolate the protein, the host cell (e.g., 
E. coli) carrying a recombinant plasmid is propagated, lysed by soni cation, heat, or 




chemical treatment, and the homogenate is centrifuged to remove bacterial debris. 
The supernatant is then subjected to sequential ammonium sulfate precipitation. The 
fraction containing the polypeptide or protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate 
5 the proteins. If necessary, the protein fraction may be further purified by HPLC. 

The DNA molecule encoding the hypersensitive response elicitor 
polypeptide or protein can be incorporated in cells using conventional recombinant 
DNA technology. Generally, this involves inserting the DNA molecule into an 
expression system to which the DNA molecule is heterologous (i.e. not normally 
10 present). The heterologous DNA molecule is inserted into the expression system or 
vector in proper sense orientation and correct reading frame. The vector contains the 
necessary elements for the transcription and translation of the inserted protein-coding 
sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 

1 5 incorporated by reference, describes the production of expression systems in the form 
of recombinant plasmids using restriction enzyme cleavage and ligation with DNA 
ligase. These recombinant plasmids are then introduced by means of transformation 
and replicated in unicellular cultures including procaryotic organisms and eucaryotic 
cells grown in tissue culture. 

20 Recombinant genes may also be introduced into viruses, such as 

vaccina virus. Recombinant viruses can be generated by transfection of plasmids into 
cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid vectors 

25 such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, 
pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript II SK +/- or KS +/- (see 
"Stratagene Cloning Systems" Catalog (1993) from Stratagene, La Jolla, Calif, which 
is hereby incorporated by reference), pQE, pIH821, pGEX, pET series (see F.W. 
Studier et. al., "Use of T7 RNA Polymerase to Direct Expression of Cloned Genes," 

30 Gene Expression Technology vol. 185 (1990), which is hereby incorporated by 

reference), and any derivatives thereof. Recombinant molecules can be introduced 
into cells via transformation, particularly transduction, conjugation, mobilization, or 



electroporation. The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by Sambrook et al., Molecular Cloning: A 
Laboratory Manual , Cold Springs Laboratory, Cold Springs Harbor, New York 
(1989), which is hereby incorporated by reference. 
5 A variety of host-vector systems may be utilized to express the protein- 

encoding sequence(s). Primarily, the vector system must be compatible with the host 
cell used. Host-vector systems include but are not limited to the following: bacteria 
transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 
microorganisms such as yeast containing yeast vectors; mammalian cell systems 
10 infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); and plant cells infected by bacteria. The expression 
elements of these vectors vary in their strength and specificities. Depending upon the 
host-vector system utilized, any one of a number of suitable transcription and 
translation elements can be used. 
15 Different genetic signals and processing events control many levels of 

gene expression (e.g., DNA transcription and messenger RNA (mRNA) translation). 

Transcription of DNA is dependent upon the presence of a promotor 
which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eucaryotic promotors differ from 
20 those of procaryotic promotors. Furthermore, eucaryotic promotors and 

accompanying genetic signals may not be recognized in or may not function in a 
procaryotic system, and, further, procaryotic promotors are not recognized and do not 

function in eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon 

25 the presence of the proper procaryotic signals which differ from those of eucaryotes. 
Efficient translation of mRNA in procaryotes requires a ribosome binding site called 
the Shine-Dalgarno ("SD") sequence on the mRNA. This sequence is a short 
nucleotide sequence of mRNA that is located before the start codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 

30 complementary to the 3 '-end of the 1 6S rRNA (ribosomal RNA) and probably 

promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see 
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Roberts and Lauer, Methods in Enzymology . 68:473 (1979), which is hereby 
incorporated by reference. 

Promotors vary in their "strength" (i.e. their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
5 strong promotors in order to obtain a high level of transcription and, hence, 

expression of the gene. Depending upon the host cell system utilized, any one of a 
number of suitable promotors may be used. For instance, when cloning in E. coli, its 
bacteriophages, or plasmids, promotors such as the T7 phage promoter, lac promotor, 
trp promotor, recA promotor, ribosomal RNA promotor, the Pr and Pl promotors of 

10 coliphage lambda and others, including but not limited, to lacUVS, ompF, bla, Ipp, 
and the like, may be used to direct high levels of transcription of adjacent DNA 
segments. Additionally, a hybrid trp-lacUY5 {tac) promotor or other E. coli 
promotors produced by recombinant DNA or other synthetic DNA techniques may be 
used to provide for transcription of the inserted gene. 

15 Bacterial host cell strains and expression vectors may be chosen which 

inhibit the action of the promotor unless specifically induced. In certain operations, 
the addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 
(isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 

20 are under different controls. 

Specific initiation signals are also required for efficient gene 
transcription and translation in procaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 

25 which contains a promotor, may also contain any combination of various "strong" 

transcription and/or translation initiation signals. For instance, efficient translation in 
E, coli requires an SD sequence about 7-9 bases 5' to the initiation codon ("ATG") to 
provide a ribosome binding site. Thus, any SD-ATG combination that can be utilized 
by host cell ribosomes may be employed. Such combinations include but are not 

30 limited to the SD-ATG combination from the cro gene or the N gene of coliphage 

lambda, or from the E. coli tryptophan E, D, C, B or A genes. Additionally, any SD- 




ATG combination produced by recombinant DNA or other techniques involving 
incorporation of synthetic nucleotides may be used. 

Once the isolated DNA molecule encoding the hypersensitive response 
elicitor polypeptide or protein has been cloned into an expression system, it is ready 
5 to be incorporated into a host cell. Such incorporation can be carried out by the 
various forms of transformation noted above, depending upon the vector/host cell 
system. Suitable host cells include, but are not limited to, bacteria, virus, yeast, 
mammalian cells, insect, plant, and the like. 

The present invention further relates to methods of imparting disease 

10 resistance to plants, enhancing plant growth, and/or effecting insect control for plants. 
These methods involve applying a hypersensitive response elicitor polypeptide or 
protein in a non-infectious form to all or part of a plant or a plant seed under 
conditions where the polypeptide or protein contacts all or part of the cells of the plant 
or plant seed. Alternatively, the hypersensitive response elicitor protein or 

15 polypeptide can be applied to plants such that seeds recovered from such plants 

themselves are able to impart disease resistance in plants, to enhance plant growth, 
and/or to effect insect control. 

As an alternative to applying a hypersensitive response elicitor 
polypeptide or protein to plants or plant seeds in order to impart disease resistance in 

20 plants, to effect plant growth, and/or to control insects on the plants or plants grown 
from the seeds, transgenic plants or plant seeds can be utilized. When utilizing 
transgenic plants, this involves providing a transgenic plant transformed with a DNA 
molecule encoding a hypersensitive response elicitor polypeptide or protein and 
growing the plant under conditions effective to permit that DNA molecule to impart 

25 disease resistance to plants, to enhance plant growth, and/or to control insects. 

Alternatively, a transgenic plant seed transformed with a DNA molecule encoding a 
hypersensitive response elicitor polypeptide or protein can be provided and planted in 
soil. A plant is then propagated from the planted seed under conditions effective to 
permit that DNA molecule to impart disease resistance to plants, to enhance plant 

30 growth, and/or to control insects. 

The embodiment of the present invention where the hypersensitive 
response elicitor polypeptide or protein is applied to the plant or plant seed can be 
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carried out in a number of ways, including: 1 ) application of an isolated elicitor 
polypeptide or protein; 2) application of bacteria which do not cause disease and are 
transformed with genes encoding a hypersensitive response elicitor polypeptide or 
protein; and 3) application of bacteria which cause disease in some plant species (but 
not in those to which they are applied) and naturally contain a gene encoding the 
hypersensitive response elicitor polypeptide or protein. 

In one embodiment of the present invention, the hypersensitive 
response elicitor polypeptide or protein of the present invention can be isolated from 
Erwinia amylovora as described in the Examples infra. Preferably, however, the 
isolated hypersensitive response elicitor polypeptide or protein of the present 
invention is produced recombinantly and purified as described supra. 

In other embodiments of the present invention, the hypersensitive 
response elicitor polypeptide or protein of the present invention can be applied to 
plants or plant seeds by applying bacteria containing genes encoding the 
hypersensitive response elicitor polypeptide or protein. Such bacteria must be capable 
of secreting or exporting the polypeptide or protein so that the elicitor can contact 
plant or plant seed cells. In these embodiments, the hypersensitive response elicitor 
polypeptide or protein is produced by the bacteria inplanta or on seeds or just prior to 
introduction of the bacteria to the plants or plant seeds. 

In one embodiment of the bacterial application mode of the present 
invention, the bacteria do not cause the disease and have been transformed (e.g., 
recombinantly) with genes encoding a hypersensitive response elicitor polypeptide or 
protein. For example, E. coli, which does not elicit a hypersensitive response in 
plants, can be transformed with genes encoding a hypersensitive response elicitor 
polypeptide or protein and then applied to plants. Bacterial species other than E. coli 
can also be used in this embodiment of the present invention. 

In another embodiment of the bacterial application mode of the present 
invention, the bacteria do cause disease and naturally contain a gene encoding a 
hypersensitive response elicitor polypeptide or protein. Examples of such bacteria are 
noted above. However, in this embodiment, these bacteria are applied to plants or 
their seeds which are not susceptible to the disease carried by the bacteria. For 
example, Erwinig amylovora causes disease in apple or pear but not in tomato. 
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However, such bacteria will elicit a hypersensitive response in tomato. Accordingly 
m accordance with this embodiment of the present invention, Ervinia amylovora can 
be apphed to tomato plants or seeds to enhance growth without causing disease in that 



species 
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The method of the present invention can be utilized to treat a wide 
variety of plants or their seeds to impart disease resistance, enhance growth and/or 
control insects. Suitable plants include dicots and monocots. More particularly 
useful crop plants can include: alfalfa, rice, wheat, barley, rye, cotton, sunflower 
peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, cabbage 
brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, turnip, radish, spinach 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco' 
tomato, sorghum, and sugarcane. Examples of suitable ornamental plants are- 
Arabidopsis thaliana, Saintpaulia, petunia, pelargonium, poinsettia, chrysanthemum, 
15 carnation, and zinnia. 

With regard to the use of the hypersensitive response elicitor protein or 
polypeptide of the present invention in imparting disease resistance, absolute 
immunity against infection may not be conferred, but the severity of the disease is 
reduced and symptom development is delayed. Lesion number, lesion size and 
extent of spoliation of fungal pathogens are all decreased. This method of imparting 
disease resistance has the potential for treating previously undatable diseases 
treating diseases systemically which might not be treated separately due to cost and 
avoiding the use of infectious agents or environmentally harmful materials. 

The method of imparting pathogen resistance to plants in accordance 
with the present invention is useful in imparting resistance to a wide variety of 
pathogens including viruses, bacteria, and fungi. Resistance, inter alia, to the 
following viruses can be achieved by the method of the present invention: Tobacco 
mosaic virus and Tomato mosaic virus. Resistance, inter alia, to the following 
bacteria can also be imparted to plants in accordance with present invention: 
Pseudomonas solancearum, Pseudomonas syringae pv. tabaci, and Xanthamonas 
campestris pv. pelargonii. Plants can be made resistant, inter alia, to the following 




fungi by use of the method of the present invention: Fusarium oxysporum and 
Phytophthora infestans. 

With regard to the use of the hypersensitive response elicitor protein or 
polypeptide of the present invention to enhance plant growth, various forms of plant 
5 growth enhancement or promotion can be achieved. This can occur as early as when 
plant growth begins from seeds or later in the life of a plant. For example, plant 
growth according to the present invention encompasses greater yield, increased 
quantity of seeds produced, increased percentage of seeds germinated, increased plant 
size, greater biomass, more and bigger fruit, earlier fruit coloration, and earlier fruit 

10 and plant maturation. As a result, the present invention provides significant economic 
benefit to growers. For example, early germination and early maturation permit crops 
to be grown in areas where short growing seasons would otherwise preclude their 
growth in that locale. Increased percentage of seed germination results in improved 
crop stands and more efficient seed use. Greater yield, increased size, and enhanced 

15 biomass production allow greater revenue generation from a given plot of land. 

Another aspect of the present invention is directed to effecting any 
form of insect control for plants. For example, insect control according to the present 
invention encompasses preventing insects from contacting plants to which the 
hypersensitive response elicitor has been applied, preventing direct insect damage to 

20 plants by feeding injury, causing insects to depart from such plants, killing insects 
proximate to such plants, interfering with insect larval feeding on such plants, 
preventing insects from colonizing host plants, preventing colonizing insects from 
releasing phytotoxins, etc. The present invention also prevents subsequent disease 
damage to plants resulting from insect infection. 

25 The present invention is effective against a wide variety of insects. 

European corn borer is a major pest of com (dent and sweet com) but also feeds on 
over 200 plant species including green, wax, and lima beans and edible soybeans, 
peppers, potato, and tomato plus many weed species. Additional insect larval feeding 
pests which damage a wide variety of vegetable crops include the following: beet 

30 armyworm, cabbage looper, com ear worm, fall army worm, diamondback moth, 

cabbage root maggot, onion maggot, seed corn maggot, pickleworm (melonworm), 
pepper maggot, and tomato pinworm. Collectively, this group of insect pests 
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represents the most economically important group of pests for vegetable production 
worldwide. 

The method of the present invention involving application of the 
hypersensitive response elicitor polypeptide or protein can be carried out through a 
variety of procedures when all or part of the plant is treated, including leaves, stems 
roots, propagules (e.g., cuttings), etc. This may (but need not) involve infiltration of 
the hypersensitive response elicitor polypeptide or protein into the plant. Suitable 
application methods include high or low pressure spraying, injection, and leaf 
abrasion proximate to when elicitor application takes place. When treating plant 
seeds, in accordance with the application embodiment of the present invention, the 
hypersensitive response elicitor protein or polypeptide can be applied by low or high 
pressure spraying, coating, immersion, or injection. Other suitable application 
procedures can be envisioned by those skilled in the art provided they are able to 
effect contact of the hypersensitive response elicitor polypeptide or protein with cells 
of the plant or plant seed. Once treated with the hypersensitive response elicitor of 
the present invention, the seeds can be planted in natural or artificial soil and 
cultivated using conventional procedures to produce plants. After plants have been 
propagated from seeds treated in accordance with the present invention, the plants 
may be treated with one or more applications of the hypersensitive response elicitor 
protein or polypeptide to impart disease resistance to plants, to enhance plant growth, 
and/or to control insects on the plants. 

The hypersensitive response elicitor polypeptide or protein can be 
applied to plants or plant seeds in accordance with the present invention alone or in a 
mixture with other materials. Alternatively, the hypersensitive response elicitor 
polypeptide or protein can be applied separately to plants with other materials being 
applied at different times. 

A composition suitable for treating plants or plant seeds in accordance 
with the application embodiment of the present invention contains a hypersensitive 
response elicitor polypeptide or protein in a carrier. Suitable carriers include water, 
aqueous solutions, slurries, or dry powders. In this embodiment, the composition ' 
contains greater than 500 nM hypersensitive response elicitor polypeptide or protein. 
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Although not required, this composition may contain additional 
additives including fertilizer, insecticide, fungicide, nematacide, and mixtures thereof. 
Suitable fertilizers include (NH 4 ) 2 N0 3 . An example of a suitable insecticide is 
Malathion. Useful fungicides include Captan. 

Other suitable additives include buffering agents, wetting agents, 
coating agents, and abrading agents. These materials can be used to facilitate the 
process of the present invention. In addition, the hypersensitive response elicitor 
polypeptide or protein can be applied to plant seeds with other conventional seed 
formulation and treatment materials, including clays and polysaccharides. 

In the alternative embodiment of the present invention involving the 
use of transgenic plants and transgenic seeds, a hypersensitive response elicitor 
polypeptide or protein need not be applied topically to the plants or seeds. Instead, 
transgenic plants transformed with a DNA molecule encoding a hypersensitive 
response elicitor polypeptide or protein are produced according to procedures well 

1 5 known in the art 

The vector described above can be microinjected directly into plant 

cells by use of micropipettes to transfer mechanically the recombinant DNA. 

Crossway, MoLGen^enetics, 202:179-85 (1985), which is hereby incorporated by 

reference The genetic material may also be transferred into the plant cell using 

20 polyethylene glycol. Krens, et al., Nature, 296:72-74 (1982), which is hereby 

incorporated by reference. 

Another approach to transforming plant cells with a gene which 
imparts resistance to pathogens is particle bombardment (also known as biolistic 
transformation) of the host cell. This can be accomplished in one of several ways. 
25 The first involves propelling inert or biologically active particles at ceUs. This 

technique is disclosed in U.S. Patent Nos. 4,945,050, 5,036,006, and 5,100,792, all to 
Sanford et al., which are hereby incorporated by reference. Generally, this procedure 
involves propelling inert or biologically active particles at the cells under conditions 
effective to penetrate the outer surface of the cell and to be incorporated within the 
30 interior thereof. When inert particles are utilized, the vector can be introduced into 
the cell by coating the particles with the vector containing the heterologous DNA. 
Alternatively, the target cell can be surrounded by the vector so that the vector is 
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carried into the cell by the wake of the particle. Biologically active particles (e.g., 
dried bacterial cells containing the vector and heterologous DNA) can also be 

propelled into plant cells. 

Yet another method of introduction is fusion of protoplasts with other 
5 entities, either minicells, cells, lysosomes or other fusible lipid-surfaced bodies. 
Fraley, et »i Vmc. Natl. Acad. Sci. USA . 79:1859-63 (1982), which is hereby 

incorporated by reference. 

The DNA molecule may also be introduced into the plant cells by 
electroporation. Fromm et al., Prnr Natl, Acad. Sci. USA, 82:5824 (1985), which is 
1 0 hereby incorporated by reference. In this technique, plant protoplasts are 

electroporated in the presence of plasmids containing the expression cassette. 
Electrical impulses of high field strength reversibly permeabilize biomembranes 
allowing the introduction of the plasmids. Electroporated plant protoplasts reform the 

cell wall, divide, and regenerate. 

Another method of introducing the DNA molecule into plant cells is to 
infect a plant cell with Agrobacterium tumefaciens or A. rhizogenes previously 
transformed with the gene. Under appropriate conditions known in the art, the 
transformed plant cells are grown to form shoots or roots, and develop further into 
plants. Generally, this procedure involves inoculating the plant tissue with a 
suspension of bacteria and incubating the tissue for 48 to 72 hours on regeneration 
medium without antibiotics at 25-28°C. 

Agrobacterium is a representative genus of the gram-negative family 
Rhizobiaceae. Its species are responsible for crown gall {A. tumefaciens) and hairy 
root disease (A rhizogenes). The plant cells in crown gall tumors and hairy roots are 
25 induced to produce amino acid derivatives known as opines, which are catabolized 
only by the bacteria. The bacterial genes responsible for expression of opines are a 
convenient source of control elements for chimeric expression cassettes. In addition, 
assaying for the presence of opines can be used to identify transformed tissue. 

Heterologous genetic sequences can be introduced into appropriate 
30 plant cells, by means of the Ti plasmid of A. tumefaciens or the Ri plasmid of A. 
rhizogenes. The Ti or Ri plasmid is transmitted to plant cells on infection by 
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Agrobacterium and is stably integrated into the plant genome. J. Schell, Science . 
237: 1 1 76-83 (1 987), which is hereby incorporated by reference. 

After transformation, the transformed plant cells must be regenerated. 

Plant regeneration from cultured protoplasts is described in Evans et 
al - H^dbook of Plant Cell Cultures Vol 1 : (MacMillan Publishing Co., New York, 
1983); and Vasil I.R. (ed.), Cell Culture and Somatic Cell n^^n, ^ p hnf „ Acad ' 
Press, Orlando, Vol. I, 1984, and Vol. Ill (1986), which are hereby incorporated by 
reference. 

It is known that practically all plants can be regenerated from cultured 
cells or tissues, including but not limited to, all major species of sugarcane, sugar 
beets, cotton, fruit trees, and legumes. 

Means for regeneration vary from species to species of plants, but 
generally a suspension of transformed protoplasts or a petri plate containing 
transformed explants is first provided. Callus tissue is formed and shoots may be 
induced from callus and subsequently rooted. Alternatively, embryo formation can be 
induced in the callus tissue. These embryos germinate as natural embryos to form 
plants. The culture media will generally contain various amino acids and hormones, 
such as auxin and cytokinins. It is also advantageous to add glutamic acid and proline 
to the medium, especially for such species as corn and alfalfa. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If 
these three variables are controlled, then regeneration is usually reproducible and 
repeatable. 

After the expression cassette is stably incorporated in transgenic plants, 
it can be transferred to other plants by sexual crossing. Any of a number of standard ' 
breeding techniques can be used, depending upon the species to be crossed. 

Once transgenic plants of this type are produced, the plants themselves 
can be cultivated in accordance with conventional procedure with the presence of the 
gene encoding the hypersensitive response elicitor resulting in disease resistance, 
enhanced plant growth, and/or control of insects on the plant. Alternatively, 
transgenic seeds are recovered from the transgenic plants. These seeds can then be 
planted in the soil and cultivated using conventional procedures to produce transgenic 
plants. The transgenic plants are propagated from the planted transgenic seeds under 
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conditions effective to impart disease resistance to plants, to enhance plant growth, 
and/or to control insects. While not wishing to be bound by theory, such disease 
resistance, growth enhancement, and/or insect control may be RNA mediated or may 
result from expression of the elicitor polypeptide or protein. 

When transgenic plants and plant seeds are used in accordance with the 
present invention, they additionally can be treated with the same materials as are used 
to treat the plants and seeds to which a hypersensitive response elicitor polypeptide or 
protein is applied. These other materials, including hypersensitive response elicitors, 
can be applied to the transgenic plants and plant seeds by the above-noted procedures, 
including high or low pressure spraying, injection, coating, and immersion. Similarly, 
after plants have been propagated from the transgenic plant seeds, the plants may be 
treated with one or more applications of the hypersensitive response elicitor to impart 
disease resistance, enhance growth, and/or control insects. Such plants may also be 
treated with conventional plant treatment agents (e.g., insecticides, fertilizers, etc.). 

Another aspect of the present invention is to utilize the subject elicitor 
proteins or polypeptides to design molecules that will inactivate, destroy, or bind to 
these proteins or polypeptides. Since these elicitors are found in plant pathogens, 
particularly Erwinia amylovora, the pathogens themselves can be neutralized by the 
designed molecules so that disease and/or hypersensitive response is prevented or 
altered. Examples of disease preventing molecules are antibodies, such as 
monoclonal or polyclonal antibodies, raised against the elicitor proteins or 
polypeptides of the present invention or binding portions thereof. Other examples of 
disease preventing molecules include antibody fragments, half-antibodies, hybrid 
derivatives, probes, and other molecular constructs. 

Monoclonal antibody production may be effected by techniques which 
are well-known in the art. Basically, the process involves first obtaining immune 
cells (lymphocytes) from the spleen of a mammal (e.g., mouse) which has been 
previously immunized, either in vivo or in vitro, with the antigen of interest (e.g., an 
elicitor protein or polypeptide of the present invention or binding portions thereof). 
The antibody-secreting lymphocytes are then fused with (mouse) myeloma cells or 
transformed cells, which are capable of replicating indefinitely in cell culture, thereby 
producing an immortal, immunoglobulin-secreting cell line. The resulting fused cells, 
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or hybridomas, are cultured, and the resulting colonies screened for the production of 
the desired monoclonal antibodies. Colonies producing such antibodies are cloned, 
and grown either in vivo or in vitro to produce large quantities of antibody. A 
description of the theoretical basis and practical methodology of fusing such cells is 
set forth in Kohler and Milstein, Nature 256:495 (1975), which is hereby incorporated 
by reference. 

Mammalian lymphocytes are immunized by in vivo immunization of 
the animal (e.g., a mouse) with the elicitor proteins or polypeptides of the present 
invention or binding portions thereof. Such immunizations are repeated as necessary 
at intervals of up to several weeks to obtain a sufficient titer of antibodies. Following 
the last antigen boost, the animals are sacrificed and spleen cells removed. 

Fusion with mammalian myeloma cells or other fusion partners 
capable of replicating indefinitely in cell culture is effected by standard and 
well-known techniques, for example, by using polyethylene glycol ("PEG") or other 
fusing agents (See Milstein and Kohler, Eur. J. Immunol . 6:51 1 (1976), which is 
hereby incorporated by reference). This immortal cell line, which is preferably 
murine, but may also be derived from cells of other mammalian species, including but 
not limited to rats, is selected to be deficient in enzymes necessary for the utilization 
of certain nutrients, to be capable of rapid growth, and to have good fusion capability. 
Many such cell lines are known to those skilled in the art, and others are regularly 
described. 

Procedures for raising polyclonal antibodies are also well known. 
Typically, such antibodies can be raised by administering the elicitor proteins or 
polypeptides of the present invention or binding portions thereof subcutaneously to 
New Zealand white rabbits which have first been bled to obtain pre-immune serum. 
The antigens can be injected at a total volume of 100 ul per site at six different sites. 
Each injected material will contain synthetic surfactant adjuvant pluronic polyols, or 
pulverized acrylamide gel containing the protein or polypeptide after SDS- 
polyacrylamide gel electrophoresis. The rabbits are then bled two weeks after the first 
injection and periodically boosted with the same antigen three times every six weeks. 
A sample of serum is then collected 10 days after each boost. Polyclonal antibodies 
are then recovered from the serum by affinity chromatography using the 
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corresponding antigen to capture the antibody. Ultimately, the rabbits are euthemzed 
with pentobarbital 150 m g /Kg IV. This and other procedures for raising polyclonal 
antibodies are disclosed in E. Harlow, et. al., editors, Anm^A^imil 
Manual (1988), which is hereby incorporated by reference. 

In addition to utilizing whole antibodies, the processes of the present 
invention encompass use of binding portions of such antibodies. Such binding 
portions include Fab fragments, F(ab') 2 fragments, and Fv fragments. These antrbody 
fragments can be made by conventional procedures, such as proteolytic fxagmentatron 
procedures, as described in J. Coding, Monoclgr^^^ 
Practice, pp. 98-1 18 (N.Y. Academic Press 1983), which is hereby incorporated by 

reference. 

Alternatively, the processes of the present invention can utihze probes 
or ligands found either in nature or prepared synthetically by recombinant DNA 
procedures or other biological or molecular procedures. SuitaWe probes or hgands 
are molecules which bind to the elicitor proteins or polypeptides of the present 

invention or binding portions thereof. ,„,,„, 
Avirulence (ovr) genes (see Vivian, A., et al, Microbiology, 143:693- 

704 (1997); Leach, J. B, et al.. ArafePtaW^ 3 * 153 " 179 < 19% * 
J L "Bacterial Pathogenesis of Plants and Animals: Molecular and Cellular 
Mechanisms," in OurenlToja cs in Microbiology . and Immunolog y.. Dangl. J. L., ed. 
(Springer, Berlin), Vol. 192, pp. 99-1 18 (1994), which are hereby incorporated by 
reference) generate signals tha, trigger defense responses leading to disease resistance 
in ptants with corresponding resistance (K) genes. Typically, ovr genes are isolated 
by expressing a cosmid library from one pathogen in another pathogen and screenmg 
for narrowed host range, a* genes traditionally have been considered as negatrve 
determinants of host specificity a. the race-cultivar level, but some, including the avrE 
,oeus from the bacterial speck pathogen Pseudomonas syringae pathovar (pv.) tomato 
(Kobayashi, D. Y., e, al., tec^tL^ciJISA, 86:157-6! (1989), which is 
hereby incorporated by reference), restrict host range a. the pathovar-spec.es or 
„ species-species level (Whalen, M. C, e, a,., Sp^L^™. 35:6743-47 
0,88); Swarup, S., et a,., MolJPlanfcMjcroJjeJnteract, 5:204-13 (1992), whrch are 
hereby incorporated by reference). Many avr genes, including avrE, are Hrp 
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regulated. avrE and avrPphE (Mansfield, J., et al, Mol. Plant-M icrobe Interact 
7:726-39 (1994), which is hereby incorporated by reference) are physically linked to 
hrp genes. 

When expressed in trans, the avrE locus renderes P. syringae pv. 
glycinea, which causes bacterial blight of soybean, avirulent in all cultivars (Lorang, 
J. M., et al., Mol. Plant- Microbe Interact. . 8:49-57 (1995), which is hereby 
incorporated by reference). The locus comprises two convergent transcription units, 
one preceded by a putative ct 54 promoter and the other by a hrp box, a sequence found 
upstream of many hrp and avr genes that are positively regulated by the alternate 
sigma factor HrpL (Innes, R. W., et al., J. Bacteriol. . 175:4859-69 (1993); Shen, H., et 
al., J. BacteroL, 175:5916-24 (1993); Xiao, Y., et al., J. Bacteriol. . 176:3089-91 
(1994), which are hereby incorporated by reference). Expression of both transcripts 
require HrpL. The avrE locus contributes quantitatively to the virulence in tomato 
leaves of P. syringae pv. tomato strain PT23, but not of strain DC3000 (Lorang, J. M., 
15 et al - Mol. Plant-Microbe Interact.. 8:49-57 (1995); Lorang, J. M., et al., Mol. Plant- 
Microbe Interact. 7:508-515 (1994)). 

Thus, avr genes in plant pathogens bind to disease resistance genes in 
plants which are not susceptible to that pathogen. In view of the homology of the 
DNA molecules of the present invention to avr genes in plant pathogens, these DNA 
20 molecules can be used to identify corresponding plant disease resistance genes. Such 
identification is carried out by traditional plant breeding techniques in which a 
pathogen carrying the avr gene is inoculated to plants in screening to track inheritance 
or identify disruption of the resistance. Once identified, the resistance gene can be 
isolated by either of two approaches that have proved successful in recent years (see 
25 Staskawicz et al., Science , 68:661-67 (1995)). These are positional or map-based 

cloning and insertional mutagenesis or transposon tagging. Because there may be no 
DspE-insensitive cultivars (susceptible to Pseudomonas harboring dspE; each of four 
soybean cultivars tested responded to dspE), map-based cloning (which requires 
crosses between susceptible and resistant lines to identify the position of the 
resistance gene relative to other genes) may not be feasible. The preferred approach 
would more likely involve insertional mutagenesis, using the dspE gene or protein in 
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screens to identify lines which had lost the the product of dspE due to transposon 
tagging of the corresponding resistance gene. 

EXAMPLES 

Example 1 - Recombinant DNA techniques. 

Isolation of DNA, restriction enzyme digests, ligation, transformation 
of Escherichia coli, and construction of and colony hybridization to screen a 
P. syringae pv. tomato DC3000 genomic library were performed as described by 
Sambrook, et al. (Sambrook, J., et al., Molecular cloninp- A T ,aboratory manual , (Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY) (1989), which is hereby 
incorporated by reference). The library was constructed using P CPP47 (Bauer, D. W., 
et al ., Mnl Plant-Microbe Interact., 10:369-379 (1997), which is hereby incorporated 
by reference). Except where noted, E. coli DH5 and E. coli DH5a were used as hosts 
for DNA clones, and pBluescript or pBC plasmids (Stratagene, La Jolla, CA) were 
used as vectors. E. amylovora was transformed by electroporation as described 
(Bauer, D. W. in "Molecular Genetics of Pathogenicity of Erwinia amylovora: 
Techniques, Tools and Their Applications", (Ph. D. Thesis), Cornell University, 
Ithaca, NY (1990), which is hereby incorporated by reference). Plasmids were 
mobilized into E. amylovora and P. syringae using pRK2013 (Figurski, D., et al., 
Pro. N«tl Annd. Sci. USA 76: 1648-1652 (1979), which is hereby incorporated by 
reference). 

Example 2- Nucleotide sequencing and analysis. 

The nucleotide sequence of the dsp region of E. amylovora strain 
Ea321 was determined using sublcones of pCPP430 (Beer, S. V., et al., in Advances 
in Molecular Gene-tics of Plan t-Mi r.mhe. Interactions. Hennecke, H., et al., eds. 
(Kluwer Academic Publishers, Dordrecht, The Netherlands), pp. 53-60 (1991), which 
is hereby incorporated by reference). The nucleotide sequence of the avrE locus was 
determined using subclones of P CPP2357, a clone selected from a P. syringae 
pv. tomato DC3000 genomic cosmid library based on hybridization with the hrpRS 
operon of P. syringae pv. syringae, and the finding, based on partial sequencing, that 
it contained the avr£locus. Nucleotide sequencing was performed by the Cornell 
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Biotechnology Sequencing Facility on a Model 377 Sequencer (Perkin Elmer/ Applied 
Biosystems Division, Foster City, CA). Sequence assembly, analysis, and 
comparisons were performed using the programs of the GCG software package, 
version 7.1 (Genetics Computer Groups, Inc., Madison, WI) and DNASTAR 
5 (DNASTAR, Inc., Madison, WI). Database searches were performed using BLAST 
(Altschul, S. F., et al., Proc. Nat. Acad. Sci. USA , 87:5509-5513 (1990) which is 
hereby incorporated by reference). 

Example 3 - Expression of DspE and DspE' in E. colL 

10 The dspE operon was cloned in two pieces into pCPPSO, a derivative 

of PINIII 113 -A2 (Duffaud, G. D., et al. in Methods in Enzymology . Wu, R., et al., eds. 
(Academic Press, New York), 153:492-50 (1987), which is hereby incorporated by 
reference) with an expanded polylinker, yielding pCPP1259. Expression in 
pCPP1259 is driven by the Ipp promoter of E. coli, under the control of the lac 

15 operator. An intermediate clone, pCPP1244, extending from the start of the operon to 
the BamRl site in the middle of dspE, also was isolated. E. coli DH5a strains 
containing pCPP1259 and pCPP1244 were grown in LB at 37°C to an OD 620 of 0.3. 
Isopropylthio-p-D-galactoside then was added to 1 mM, and the cells further 
incubated until reaching an OD620 of 0.5. Cells were concentrated two-fold, lysed and 

20 subjected to SDS-PAGE as previously described (Sambrook, J., et al., Molecular 

cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY) (1989), which is hereby incorporated by reference), using 7.5% acrylamide. 
Cells containing pCPP50 were included for comparison. Proteins were visualized by 
Coomassie staining. 

25 

Example 4 - Deletion mutagenesis of dspE. 

1554 bp were deleted from the 5' Hindlll-BamUl fragment of dspE in 
pCPP1237 using unique Stul and Smal sites. The mutagenized clone then was 
inserted into the suicide vector pKNGlOl (Kaniga, K., et al., Gene , 109:137-42 
30 (1991), which is hereby incorporated by reference) using E. coli SMIOXp/r as a host, 
yielding pCPP1241. The mutation, designated A1554, then was transferred into 
E. amylovora strains using marker eviction as described previously (Bogdanove, A. J., 
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et al., J. Bacteriol. , 178:1720-30 (1996), which is hereby incorporated by reference). 
1521 bp were deleted from the 3' Hindlll fragment of dspE in pCPP1246 using two 
BstEll sites blunted with Klenow fragment. This mutation, A 1521, was transferred 
into E. amylovora strains as above. 

Example 5 - Pathogenicity assays. 

For E. amylovora strains, cell suspensions of 5 x 10 8 colony-forming 
units (cfu) per ml were pipetted into wells cut in immature Bartlett pear fruit, or 
stabbed into Jonamac apple and cotoneaster shoots, and assays carried out as 
described previously (Beer, S. V., in Methods in P hvtobacteriology, Klement, Z., et 
al., eds. (Adademiai Kiadoo, Budapest), pp. 373-374 (the "1990); Aldwinckle, H. S., 
et al., Phytopathology , 66: 1439-44 (1976), which are hereby incorporated by 
reference). For P. syringage pv. glycinea, panels of primary leaves of 2-week-old 
soybean seedlings {Glycine max, cultivar Norchief) were infiltrated with bacterial 
suspensions of 8 x 10 5 cfu/ml as for the HR assay, below. Plants were then covered 
with clear plastic bags and incubated under fluorescent lights (16 hr/day) at 22°C for 
5-7 days. Leaves were scored for necrosis and chlorosis. 

Example 6 - HR assays. 

Tobacco leaf panels (Nicotiana tabacum L. 'xanthi') were infiltrated 
with bacterial cell suspensions as described previously (Wei, Z. M., et al., Science , 
257:85-88 (1992); Bauer, D. W., et al., Mol. Plant-Microbe Interact. , 4:493-99 (1991), 
which are hereby incorporated by reference). Primary leaves of 2-week-old soybean 
seedlings (secondary leaves emerging) were infiltrated with bacterial cell suspensions 
as for tobacco. Plants were scored for HR (tissue collapse) after 24-48 hr on the 
laboratory bench. E. amylovora strains were suspended in 5 mM KP0 4 buffer, pH 
6.8, and P. syringae strains in 10 mM MgCl 2 . 

Example 7 - GUS assays. 

Cells were 1.) grown in LB to an OD 620 of 0.9-1.0; 2.) grown in LB to 
an OD 6 20 of 0.5, then washed and resuspended in a /zrp-gene-inducing minimal 
medium (Hrp MM; Huynh, T. V., et al., Science , 345:1374-77 (1989), which is 
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hereby incorporated by reference) to an OD 6 20 of 0.2 and incubated at 21° C for 36 hr 
to a final OD 62 o of 0.9-1 .0; or 3.) grown in LB to an OD 620 of 0.5, washed and 
concentrated 2-fold in 5 mM KP0 4 buffer, pH 6.8, and then transferred to freshly cut 
wells in pear halves and incubated as for the pathogenicity assay for 36 hr. Cells were 
assayed for p-glucuronidase (GUS) activity essentially according to Jefferson 
(Jefferson, R. A., Plant Molecular Biology Reporter . 5:387-405 (1987), which is 
hereby incorporated by reference). For the cells in LB or Hrp MM, 50 jxl were mixed 
with 200 ul GUS extraction buffer (50 mM NaHP0 4 , pH 7.0, 10 mM 
P-mercaptoethanol, 10 mM Na 2 EDTA, 0.1% sodium lauryl sarcosine, 0.1% Triton 
X-100) containing 2 mM 4-methylumbelliferyl p-D-glucuronide as substrate and 
incubated at 37° C for 100 min. For cells in pear fruit, the tissue surrounding the well 
was excised using a #4 cork borer and homogenized in 5 mM KP0 4 buffer, pH 6.8. 
200 (J.1 of homogenate was mixed with 800 ul of GUS extraction buffer with substrate 
and incubated as above. Reactions were stopped by adding Na 2 C0 3 to a final 
concentration of 0.2 M in a total volume of 2 ml. Fluorescence was measured using a 
TKO 100 Mini-Fluorometer (Hoefer Scientific Instruments, San Francisco, CA). For 
all samples, cell concentration was estimated by dilution plating, and fluorometric 
readings were converted to pmole of substrate hydrolyzed/10 8 cfu/min, after Miller 
(Miller, J. H., A Short Course in Bacterial Genetics: A Laboratory Ma nual and 
Handbook for Escherichia coli and Related Bacteria (Cold Spring Harbor Laboratory 
Press, Plainview, NY) (1992), which is hereby incorporated by reference). 

Example 8 - The "disease-specific" (dsp) region of E. amylovora consists of a 
6.6 kb, two-gene operon. 

Mapping of previous transposon insertions (Steinberger, E. M., et al., 
Mol. Plant-Microbe Interact. . 1:135-44 (1988), which is hereby incorporated by 
reference) that abolish pathogenicity but not HR-eliciting ability confirmed the 
presence of the "disease specific" (dsp) region downstream of the hrpN gene in strain 
Ea321 as reported in strain CFBP1430 (Barny, A. M., et al., Mol. Microbiol., 4:777- 
86 (1990), which is hereby incorporated by referece). The sequence of approximately 
15 kb of DNA downstream of hrpN from Ea321 was determined, revealing several 
open reading frames (ORFs' Fig. 1). One ORF, in an apparent 6.6 kb operon with a 
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smaller ORF, spanned the area to which the dsp insertions mapped. These two ORFs 
were designated and dspF^l&e^xorrrckpE. dspE is preceded (beginning 

70 bp upstream of the initiation codon) by the sequence GGAACCN, S CAACATAA 
(SEQ. ID. No. 5), which matches the HrpL-dependent promoter consensus sequence 
or "hrpbox" ofE. amylovora (Kim, J. H., et al., J. Bacteriol. . 179:1690-97 (1997); 
Kim, J. H., et al., J. Bacteriol. , 179:1690-97 (1997), which are hereby incorporated by 
reference) and strongly resembles the hrp box of P. syringae hrp and avr genes (Xiao, 
Y., et al., J. Bacteriol. , 176:3089-91 (1994), which is hereby incorporated by 
reference). Immediately downstream of dspF is A/T-rich DNA, followed by an ORF 
(ORF7) highly similar to the Salmonella typhimurium gene spvR, a member of the 
lysR family of regulatory genes (Caldwell, A. L. & Gulig, P. A., J. Bacteriol 
173:7176-85 (1991), which is hereby incorporated by reference). Immediately 
upstream of the dspE operon is a Hrp-regulated gene, hr P W, encoding a novel harpin. 

The deduced product of dspE contains 1 838 amino acid residues and is 
hydrophilic. The predicted molecular weight, 198 kD, was confirmed by expression 
in E. coli (Fig. 2). Expression of an intermediate clone containing only the 5' half of 
dspE yielded a protein of corresponding predicted mobility, suggesting that the 
N-terminal half of the protein might form an independently stable domain. DspF, 
predicted to be 16 kD, acidic (pi, 4.45), and predominantly a-helical, with 
amphipathic a helices in its C-terminus, is physically similar to virulence factor 
chaperones of animal-pathogenic bacteria (Wattiau, P., et al., Mol. Microbiol. 
20:255-62 (1996), which is hereby incorporated by reference). 

Example 9 - dspE is required for fire blight. 

Two in-frame deletions within dspE (Fig. 1 ) were made in Ea32 1 and 
Ea273 (low- and high-virulence strains, respectively). The first (A1554) corresponds 
to amino acid residues G 203 to G 720 , and the second (A 1521) to amino acid residues 
T,o64 to V I570 . Each deletion abolished the ability of both strains to cause fire blight 
when inoculated to immature pear fruit (Fig. 3), apple shoots, or cotoneaster shoots. 
A1554 was complemented by a clone carrying only the overlapping 5' half of dspE, 
further suggesting that the N-terminus of the protein forms a stable domain (Figs. 1 
and 3). 
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Example 10 - The dspE operon contributes quantitatively and in a 

strain-dependent fashion to HR elicitation by E. amylovora in 
tobacco and is not required for HR elicitation by E. amylovora in 
soybean. 

Transposon insertions in the dsp region reduce the ability of 
E. amylovora to elicit the HR in tobacco (Barny, A. M., et al., Mol. Microbiol.. 4:777- 
86 (1990), which is hereby incorporated by reference). Dilution series of suspensions 
of dspEA\554 mutant strains of Ea321 and Ea273 were infiltrated into tobacco leaves 
alongside their wild-type parents to assess the role of dspE in HR elicitation (Fig. 3). 
All strains were capable of eliciting the HR, but Ea321 <&p£A1554, on a per-cell 
basis, was roughly one-tenth as effective as the wild-type in eliciting tissue collapse. 
There was no noticeable difference in HR-eliciting activity, however, between Ea273 
and Ea273<ftp£A1554. Ea321dsp£A1554 elicited wild-type HR in Acme, Centennial, 
Harasoy, and Norchief soybean leaves (Fig. 3). 

Example 11 - The dspE operon is Hrp-regulated. 

A promoterless uidA gene construct was cloned downstream of the 
dspE fragment in pCPP1241 that was used to introduce the A1554 mutation (Fig. 1) 
into wild-type strains of E. amylovora (this construct consists of a 3 '-truncated dspE 
gene with the internal deletion). The resulting plasmid, pCPP1263, was mobilized 
into Ea321 and Ea273. Pathogenic strains, in which plasmid integration had 
preserved an intact copy oidspE, and non-pathogenic strains, in which the native 
copy of dspE had been mutated, were isolated. All strains were assayed for GUS 
activity in Luria Bertani medium (LB) and in Hrp MM, and pathogenic strains were 
assayed for activity in pear fruit. High levels of activity were obtained from strains 
incubated in Hrp MM and pear, but not LB. The level of expression in Hrp MM was 
equivalent to that of a hrcV-uidA fusion ("G73", Wei, et al., J. Bacteriol., 177:6201-10 
(1995), which is hereby incorporated by reference) used as a positive control. There 
were no significant differences in levels of expression of the dspE-uidA fusion in the 
wild-type and dspE mutant backgrounds, indicating that dspE likely is not 
autoregulated. Expression of the dspE-uidA fusion in hrpL mutants of Ea321 and 
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Ea273 in hrp MM was two orders of magnitude lower than that in HrpL + strains. 
Data for Ea273 and derivatives are shown in Fig. 4. 

Example 12 - dspE and dspF are homologous with genes in the avrE locus of 
Pseudomonas syringage pv. tomato. 

A BLAST (Altschul, S. F., et al., J. Mol. Biol, 215:403-10 (1990), 
which is hereby incorporated by reference) search of the genetic databases revealed 
similarity of dspE to a partial sequence of the avrE locus of P. syringae pv. tomato 
(Lorang, J. M, et al., ^ Pi^t-Microhe Interact, 8:49-57 (1995), which is hereby 
incorporated by reference). A cosmid library of P. syringae pv. tomato DC3000 
genomic DNA was constructed, and a clone overlapping the hrp gene cluster and 
containing the avrE locus was isolated (pCPP2357). The complete nucleotide 
sequence of the avrE locus was determined, revealing the homolog of dspE (encoding 
a 195 kD, 1795 amino acid protein of 30% identity) alone in an operon previously 
designated transcription unit III, and a homolog of dspF (encoding a 14 kD, a 129 
amino acid protein of 43% identity) at the end of the juxtaposed and opposing operon 
previously designated transcription unit IV (Fig. 1). These genes are designated avrE 
and.ovrF. The C-terminal half of the DspE and AvrE alignment (from V 845 of DspE) 
shows greater conservation (33% identity) than the N-terminal half (26% identity). 
AvrE contains a motif (aa residues A4 50 to T 457 ) conserved in ATP- or GTP-binding 
proteins ("P-loop"; Saraste, M, et al., Trends Biochem. Sci., 15:430-34 (1990), which 
is hereby incorporated by reference). This motif is not conserved in DspE, however, 
and its functional significance in AvrE, if any, is unclear. Amino acid identities are 
distributed equally throughout the DspF and AvrF alignment, and AvrF^ares the 
predicted physical characteristics of DspF. Upstream of avr^ee^petog the operon, 
is a 2.5 kb gene with no similarity to sequences in the genetic databases. 

Example 13 - The dspE operon functions as an avirulence locus. 

The dspE operon was cloned into pML 122 (Labes, M., et al., Gene, 
89:37.46 (1990), which is hereby incorporated by reference) downstream of the nptll 
promoter, and this construct, pCPP1250, was mobilized into P. syringae pv. glycinea 
race 4. The resulting strain, but not a control strain containing pML 122, elicited the 
HR in soybean cultivars Acme, Centennial, Harasoy, andNorchief; in Norchief plants 



# 



-41 - 

incubated under conducive conditions, race 4 harboring pCPP1250 failed to cause 
symptoms of disease, while the control strain caused necrosis and chlorosis that 
spread from the point of inoculation (Fig. 5). 

5 Example 14 - avrE complements dspE mutations. 

Cosmid pCPP2357 was mobilized into Ea321 dspE mutant strains 
A1554 and A1521. The resulting transconjugants were pathogenic but low in 
virulence. Ea321<&p£A1521 carrying pCPP2357 with a transposon insertion in the 
avrE gene was non-pathogenic, demonstrating that the complementation observed 

10 was avris-specific (Figs. 1 and 5). The same results were observed for 
transconjugants of the Ea273dspEAl521 mutant. 

Over thirty bacterial avr genes have been discovered. The plethora of 
avr genes is thought to result from an "evolutionary tug-of-war" (Dangl, J. L., in 
Bacterial Pathogenesis of Plants and Animals: Molecular and Cellular Mechanisms 

15 f Current Topics in Microbiology and Immunology) , Dangl. J. L., ed. (Springer, 

Berlin), 192:99-118 (1994), which is hereby incorporated by reference), a reiterative 
process of selection, counterselection due to R genes, and modification or substitution 
of avr genes that was originally discerned by Flor, who hypothesized that "during 
their parallel evolution host and parasite developed complementary genie systems" 

20 (Flor, H. H., Adv. Genet. , 8:29-54 (1956), which is hereby incorporated by reference). 
However, only a few avr genes (including avrE in strain PT23) play detectable roles 
in virulence or pathogen fitness in their native genetic background (Lorang, J. M., et 
al., MoL Plant-Microbe Interact . 7:508-15 (1994); Kearney, B., et al., Nature , 
346:385-86 (1990); Swarup, S., et al., Phytopathology , 81:802-808 (1991); De Feyter, 

25 R. D., et al., Mol. Plant-Microbe Interact. , 6:225-37 (1993); Ritter, C, et ah, MoL 
Plant-Microbe Interact. , 8:444-53 (1995), which are hereby incorporated by 
reference), and the selective force driving the maintenance in pathogen genomes of 
many of these host-range-limiting factors has remained a mystery. It is now clear, 
though, that several Avr proteins are delivered into plant cells by the Hrp pathway 

30 (Gopalan, S., et al., Plant Celli, 8:1095-1 105 (1996); Tang, X., et al., Science , 

274:2060-63 (1996); Scofield, S. R., et al., Science , 274:2063-65 (1996); Leister, R. 
T., et al., Proc. Natl. Acad. Sci. USA , 93:15497-15502 (1996); Van Den Ackerveken, 
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G., et al., Cell, 87:1307-16 (1996), which are hereby incorporated by reference) and, 
therefore, are likely to be fundamentally virulence factors, which interact (directly, or 
indirectly through enzymatic products) with host targets to promote parasitism. 
Mutation of such targets (selected because of reduced susceptibility) as well as the 
evolution of R proteins that recognize the Avr proteins would force the acquisition or 
evolution of new or modified Avr proteins and result in the proliferation of avr genes. 
Cumulatively, these co-evolutionary processes likely would drive a trend toward avr 
genes with quantitative and redundant effects in pathogenesis rather than critically 
important roles (Alfano, J. R., et al, Plant Cell , 8:1683-16988 (1996), which is hereby 

incorporated by reference). 

It has been found that the homologs dspE and avrE contribute to 
disease to dramatically different extents. The avirulence locus can substitute 
transgenerically for the pathogenicity operon, and that the avirulence function of dspE 
extends across pathogen genera as well. These findings support the hypothesis that 
avr genes have a primary function in disease. Moreover, they support and expand the 
coevolutionary model for avr gene proliferation discussed above, and they have 
practical implications concerning the control of fire blight and other bacterial diseases 
of perennials. 

One can predict from the model that the relative contribution to 
pathogenicity of a particular factor would reflect, in part, the genetic history of the 
pathogen, specifically, the degree of co-evolution with its host(s). dspE is required 
for pathogenicity; avrE has a quantitative, strain-dependent, virulence phenotype. 
Consistent with the prediction, evolution of corresponding R genes and modification 
of targets of pathogen virulence factors is likely to have occurred more often and to a 
greater extent over time in the herbaceous hosts typically infects by P. syringae 
pathovars than in the woody hosts with which E. amylovora presumably evolved. 
Alternatively or additionally, acquisition of dspE (through evolution or horizontal 
transfer) by E. amylovora could have occurred relatively more recently than 
acquisition of avrE by P. syringae, allowing less time for coevolution leading to 
modification or the development of redundant function. 

One could also hypothesize from the model that virulence factors may 
be conserved among pathogens, yet individually adapted to avoid detection on a 
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particular host. Preliminary results from Southern blot hybridizations suggest that 
P. syringae pv. glycinea harbors an avrE homolog, which, if functional, would 
support such a hypothesis. Similarly, homologs of the soybean cultivar-specific genes 
avr A and avrD from P. syringae pv. tomato exist in P. syringae pv. glycinea 
(Kobayashi, D. Y., et al., Pror. Natl Acad. Sci. USA, 86:157-161 (1989), which is 
hereby incorporated by reference). 

The homology and abilities of dspE and avrE to function 
transgenerically expand the model for avr gene proliferation. Major components of 
an evolution toward multifactor virulence could be procurement of genes encoding 
novel virulence factors from heterologus pathogens, and conservation of a 
functionally cosmopolitan virulence factor delivery system (and possibly conservation 
of a universal Hrp-pathway-targeting signal on the factors themselves) that would 
enable their deployment. Indeed, many avr genes are on plasmids and scattered in 
their distribution among pathogen strains (Dangl, J. L., in Bacterial Pathogenesis of 
P w. „nd Anir r»i«- Molecular an d Cellular M echanisms (Current Topics in 
^^K;»in pv ,nd Tmmunologvl , Dangl. J. L., ed. (Springer, Berlin), 192:99-118 
(1994), which is hereby incorporated by reference), and individual hp genes are 
conserved and even interchangeable (Arlat, M., et al., Mol Plant-Microbe Interact. , 
4:593-601 (1991); Laby, R. J., et al., Mol Plant-Microbe Interact., 5:412-19 (1992), 
which is hereby incorporated by reference). The presence of dspE and avrE in 
distinct genera suggests horizontal transfer of an ancestral locus, and, although dspE 
and avrE are homologous and ^-linked, the transgeneric function of these genes 
suggests that the Hrp pathways in E. amylovora and P. syringae have remained 
insensitive to differences accrued in DspE and AvrE over evolution. It is predicted 
that even non-homologous Avr-like proteins will function across phytopathogenic 
bacterial genera. 

It remains to be shown whether the avirulence function of the dspE 
locus is Hrp-pathway-dependent. This seems likely, and it will be important to 
determine the localization of the dspE and dspF gene products in the plant-bacterial 
interaction. The physical similarity of DspF (and AvrF) to chaperones required for 
type III secretion of virulence factors from animal-pathogenic bacteria (Wattiau, P., et 
al., Mol. Microbiol., 20:255-62 (1996), which is hereby incorporated by reference) is 
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intriguing and novel in phytopathogenic bacteria. The requirement of these 
chaperones appears to be due to a role other than targeting to the secretion pathway 
(Woestyn, S., et al., Mol^icrobioL, 20:1261-71 (1996), which is hereby 
mcorporated by reference): chaperones may stabilize proteins, maintain proteins in an 
approbate conformation for secretion, or prevent premature polymerization or 
associate with other proteins. Perhaps, DspF binds to DspE (and AvrF to AvrE) and 
Plays a similar role, which might be particularly important for the latter protein due to 
its large size and probable multidomain nature. 

The dspE operon is the first described avirulence locus in 
E. amylovora. A homolog of avrRxv from Xanthomas campestris (Whalen M C 
et al., ^m^^JMA, 85:6743-47 (1988), which is hereby incorporated" 
by reference) has been found near the dspE operon (Kim, J. F., in Molecular 
Char acterization of a N ovelHarr^n^^ 

^^^^ (Ph.D. Thesis), Cornell University 

Ithaca, NY (1997)). Monogenic (^-gene-mediated) resistance to fire blight has not ' 
been reported, but differential virulence of E. amyolovora strains on apple cultivars 
has been observed (Norelli, J. L., et al., Phyip^to, 74:136-39 (1984) which is 
hereby incorporated by reference). Also, some strains of E. amylovora infect Rutus 
spp. and not pomaceous plants, and vice-versa (Starr, M. P., et al., Pim^athplogy 
41:915-19 (1951), which is hereby incorporated by reference). Whether the dspE 
operon and the avrRxv homolog or other potential elicitors play a role in these 
specificities should be determined. 

Although the dspE operon triggers defense responses in soybean when 
expressed m P. syringae pv. glycinea, it is not required for the HR of soybean elicited 
by E. amylovora. Neither is /^required (Fig. 3). It is possible that E amylovora 
must have one or the other, dspE or hrpN, to elicit the HR in soybean It has been 
observed, however, that purified harpin does not elicit the HR in soybean, suggesting 
the alternative explanation that E. amylovora harbors another avr gene recognized by 
this plant. 

Recognition of E. amylovora avirulence signals in soybean indicates 
the presence of one or more R genes that might be useful for engineering fire blight 
resistant apple and pear tree, ^-gene-mediated resistance to the apple scab pathogen 
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Venturia inaequalis (Williams, E. B., et al, Ann. Rev, Phvtopathol.. 7:223-46 (1969), 
which is hereby incorporated by reference) and successful transformation of apple 
with attacin E for control of fire blight (Norelli, J. L., et al, Eu phytica , 77:123-28 
(1994), which is hereby incorporated by reference) attest the feasibility of such an 
approach. R gene-mediated resistance to apple scab has been overcome in the field 
(Parisi, L., et ah, Phytopathology , 83:533-37 (1993), which is hereby incorporated by 
reference), but the requirement for dspE in disease favors relative durabiliity of a 
(&p£-specific R gene (Kearney, B. et al., Nature , 346:385-86 (1990), which is hereby 
incorporated by reference). Avirulence screening of dspE and other E. amylovora 
genes in pathogens of genetically tractable plants such as Arabidopsis could broaden 
the pool of candidate R genes and hasten their isolation. A similar approach could be 
used to isolate R genes effective against other diseases of woody plants. Furthermore, 
if the dspE operon is as widely conserved as is suggested by its homology with the 
avrE locus, a corresponding R gene could be effective against a variety of pathogens 
both of woody and herbaceous plants. 

Native (non-denatured) DspE protein has not been produced in 
sufficient quantity to test its ability to elicit the HR (i.e. hypersensitive response) in a 
manner similar to hypersensitive response elicitors (i.e., by exogenous application). 
Therefore, no one has shown that dspE of E. amylovora elicits the HR when applied 
to plants as an isolated cell-free material. However, when the gene encoding the 
protein is transferred to another bacterium (along with the smaller dspF gene), e.g., 
Pseudomonas syringae, which ordinarily causes disease on certain plants, the 
recipient bacterium no longer causes disease but instead elicits the HR. The 
mechanism for this is not known for sure, but it is suspected to involve (and there is 
compelling evidence for) a mechanism in which the bacterial cell actually injects the 
DspE protein into the living plant cell, triggering the development of plant cell 
collapse (i.e. HR). Presumably, when the DspE protein is in the living plant cell, it 
might signal the plant to develop resistance to insects and pathogens. 

Based on the similarity of the predicted physical characteristics of 
DspF to those of known chaperone proteins from animal pathogens, it is believed that 
this rather small protein is a chaperone of DspE. Chaperones in animal pathogens 
bind in the cytoplasm to specific proteins to be secreted. They seem to be required for 
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secretion of the proteins but are not themselves secreted. Evidence suggests that the 
chaperones are not involved directly in targeting the secreted proteins to the secretion 
apparatus. Instead, they may act to stabilize the proteins in the cytoplasm and/or 
prevent their premature aggregation or association with other proteins (e.g., bacterial 
5 proteins that direct transport through the host cell-membrane). 

The dspE gene bears no similarity to known genes except avrE. 
Enzymatic function (i.e., one resulting in the production of a secondary molecule that 
elicits the HR) of DspE cannot be ruled out at present. In fact, one avr gene product is 
known to elicit HR indirectly by catalyzing synthesis of a diffusible elicitor molecule. 
10 However, the simplest explanation for the observed HR eliciting function of the dspE 
operon expressed in Pseudomonas species is that the protein encoded by the dspE 
gene is secreted from the bacterium and possibly transported into the plant cell, that 
there it triggers directly plant defense responses leading to the HR, and that this 
process is mediated by a specific resistance gene product that recognizes (acts as a 
1 5 receptor of) the DspE protein. Indeed, four avr genes that depend on the Hrp 

secretory apparatus to function when expressed in bacteria have been shown to cause 
HR when expressed transgenically within plant cells. One of these has been shown to 
encode a protein that directly interacts with the product of its corresponding resistance 
gene. Ultimately, whether DspE elicits plant defense responses from outside or inside 
20 the plant cell, directly or through a secondary molecule, must be determined in order 
to define practical applications of this protein and its encoding gene as a plant defense 
elicitor. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations 
25 can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATGGAATTAA AATCACTGGG AACTGAACAC AAGGCGGCAG TACACACAGC GGCGCACAAC 
CCTGTGGGGC ATGGTGTTGC CTTACAGCAG GGCAGCAGCA GCAGCAGCCC GCAAAATGCC 
GCTGCATCAT TGGCGGCAGA AGGCAAAAAT CGTGGGAAAA TG CCGAGAAT TCACCAGCCA 
TCTACTGCGG CTGATGGTAT CAGCGCTGCT CACCAGCAAA AGAAATCCTT CAGTCTCAGG 
GGCTGTTTGG GGACGAAAAA ATTTTCCAGA TCGGCACCGC AGGGCCAGCC AGGTACCACC 
CACAG CAAAG GGGCAACATT GCGCGATCTG CTGGCGCGGG ACGACGGCGA AACGCAGCAT 
GAGGCGGCCG CGCCAGATGC GGCGCGTTTG ACCCGTTCGG GCGGCGTCAA ACGCCGCAAT 
ATGGACGACA TGGCCGGGCG GCCAATGGTG AAAGGTGGCA GCGGCGAAGA TAAGGTACCA 
ACGCAGCAAA AACGGCATCA GCTGAACAAT TTTGGCCAGA TGCGCCAAAC GATGTTGAGC 
AAAATGGCTC ACCCGGCTTC AGCCAACGCC GGCGATCGCC TGCAGCATTC ACCGCCGCAC 
ATCCCGGGTA GCCACCACGA AATCAAGGAA GAACCGGTTG GCTCCACCAG CAAGGCAACA 
ACGGCCCACG CAGACAGAGT GGAAATCGCT CAGGAAGATG ACGACAGCGA ATTCCAGCAA 
CTGCATCAAC AGCGGCTGGC GCGCGAACGG GAAAATCCAC CGCAGCCGCC CAAACTCGGC 
GTTGCCACAC CGATTAGCGC CAGGTTTCAG CCCAAACTGA CTGCGGTTGC GGAAAGCGTC 
CTTGAGGGGA CAGATACCAC GCAGTCACCC CTTAAGCCGC AATCAATGCT GAAAGGAAGT 
GGAG CCGGGG TAACGCCGCT GGCGGTAACG CTGGATAAAG GCAAGTTGCA GCTGGCACCG 
GATAATCCAC CCGCGCTCAA TACGTTGTTG AAGCAGACAT TGGGTAAAGA CACCCAGCAC 
TATCTGGCGC ACCATGCCAG CAGCGACGGT AGCCAGCATC TGCTGCTGGA CAACAAAGGC 
CACCTGTTTG ATATCAAAAG CACCGCCACC AGCTATAGCG TGCTGCACAA CAGCCACCCC 
GGTGAGATAA AGGGCAAGCT GGCGCAGGCG GGTACTGGCT CCGTCAGCGT AGACGGTAAA 
AGCGGCAAGA TCTCGCTGGG GAGCGGTACG CAAAGTCACA ACAAAACAAT GCTAAGCCAA 
CCGGGGGAAG CGCACCGTTC CTTATTAACC GGCATTTGGC AGCATCCTGC TGGCGCAGCG 
CGGCCGCAGG GCGAGTCAAT CCGCCTGCAT GACGACAAAA TTCATATCCT GCATCCGGAG 
CTGGGCGTAT GGCAATCTGC GGATAAAGAT ACCCACAGCC AGCTGTCTCG CCAGGCAGAC 
GGTAAGCTCT ATGCGCTGAA AGACAACCGT ACCCTGCAAA ACCTCTCCGA TAATAAATCC 
TCAG AAAAG C TGGTCGATAA AATCAAATCG TATT CCGTTG ATCAGCGGGG GCAGGTGGCG 
ATCCTGACGG AT ACT CCCGG CCGCCATAAG ATGAGTATTA TGCCCTCGCT GGATGCTTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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C CG GAG AG C C ATATTTCCCT CAGCCTGCAT TTTGCCGATG CCCACCAGGG GTTATTGCAC 16 8 0 

GGGAAGTCGG AGCTTGAGGC AGAATCTGTC GCGATCAGCC ATGGG CGACT GGTTGTGGCC 174 0 

GAT AG CG AAG GCAAGCTGTT TAGCGCCGCC ATTCCGAAGC AAGGGGATGG AAACGAACTG 180 0 

AAAATGAAAG CCATGCCTCA GCATGCGCTC GATGAACATT TTGGTCATGA CCACCAGATT I8 60 

TCTGGATTTT TCCATGACGA CCACGGCCAG CTTAATG CG C TGGTGAAAAA TAACTTCAGG 192 0 

CAGCAGCATG CCTGCCCGTT GGGTAACGAT CATCAGTTTC ACCCCGGCTG GAACCTGACT 198 0 

GATGCGCTGG TTAT CGACAA TCAG CTGGGG CTGCATCATA CCAATCCTGA ACCG CATGAG 2 04 0 

ATT CT TG ATA TGGGG CATTT AGGCAGCCTG GCGTTACAGG AGGGCAAGCT TCACTATTTT 2100 

GACCAGCTGA CCAAAGGGTG GACTGG CG CG GAGTCAGATT GT AAG GAG CT GAAAAAAGGC 2160 

CTGGATGGAG CAGCTTATCT ACTGAAAGAC GG TG AAG TG A AACGCCTGAA TATTAATCAG 222 0 

AGCACCTCCT CTATCAAGCA CGGAACGGAA AACGTTTTTT CGCTGCCGCA TGTGCGCAAT 22 80 

AAACCGGAGC CGGGAGATGC CCTGCAAGGG CTGAATAAAG ACGATAAGGC CCAGGCCATG 23 4 0 

GCGGTGATTG GGG TAAATAA ATACCTGGCG CTGACGGAAA AAGGGGACAT TCGCTCCTTC 24 0 0 

CAGATAAAAC CCGGCACCCA GCAGTTGGAG CGGCCGGCAC AAACTCTCAG CCGCGAAGGT 24 60 

ATCAGCGGCG AACTGAAAGA CATTCATGTC GACCACAAGC AGAACCTGTA TGCCTTGACC 2 52 0 

CACGAGGGAG AGGTGTTTCA TCAGCCGCGT GAAGCCTGGC AGAATGGTGC CGAAAGCAGC 2 58 0 

AGCTGGCACA AACTGGCGTT GCCACAGAGT GAAAGT AAG C TAAAAAGTCT GGACATGAGC 2 64 0 

CATGAGCACA AACCGATTGC CACCTTTGAA GACGGT AG C C AG CATCAG CT GAAGGCTGGC 2 70 0 

GGCTGGCACG CCTATGCGGC ACCTGAACGC GGGCCGCTGG CGGTGGGTAC CAGCGGTTCA 2 760 

CAAACCGTCT TTAAC CGACT AATG CAGGGG GTGAAAGGCA AGGTGATCCC AGGCAGCGGG 2 82 0 

TTGACGGTTA AGCTCTCGGC TCAGACGGGG GG AATG AC CG GCGCCGAAGG GCGCAAGGTC 2880 

AG CAGTAAAT TTTCCGAAAG GATCCGCGCC TATGCGTTCA ACCCAACAAT GTCCACGCCG 2 94 0 

CGACCGATTA AAAATGCTGC TTATGCCACA CAGCACGGCT GG CAGGGG CG TGAGGGGTTG 3 00 0 

AAGCCGTTGT ACGAGATG CA GGG AG CG CTG ATTAAACAAC TGGATGCGCA TAACGTTCGT 3 060 

CATAACG CG C CACAGCCAGA TTTGCAGAGC AAACTGGAAA CTCTGGATTT AGGCGAACAT 312 0 

GGCGCAGAAT TG CTTAACGA CATGAAGCGC TTCCGCGACG AACTGGAGCA GAGTGCAACC 3180 

CGTTCGGTGA CCGTTTTAGG TCAACATCAG GG AG TG C T AA AAAGCAACGG TGAAATCAAT 324 0 

AG CG AATTTA AGCCATCGCC CGGCAAGGCG TTGGTCCAGA GCTTTAACGT CAATCGCTCT 33 00 

GGTCAGGATC TAAGCAAGTC ACTGCAACAG GCAGTACATG CCACGCCGCC AT CCG CAGAG 3 3 60 

AGTAAACTGC AATCCATGCT GGGGCACTTT GTCAGTGCCG GGGTGGATAT GAGTCATCAG 3420 

AAGGGCGAGA TCCCGCTGGG CCGCCAGCGC GAT C CG AATG AT AAAAC CG C ACTGAC CAAA 34 8 0 



3660 
3720 
3780 
3840 
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TCGCGTTTAA TTTTAGATAC CGTGACCATC GGTGAACTGC ATGAACTGGC CGATAAGGCG 3 54 0 

AAACTGGTAT CTGACCATAA ACCCGATGCC GAT GAG AT AA AACAGCTGCG CCAGCAGTTC 3600 
GATACGCTGC GTGAAAAGCG GTATGAGAGC AATCCGGTGA AGCATTACAC CGATATGGGC 
T TCAC C CAT A ATAAGGCGCT GGAAGCAAAC TATGATGCGG TCAAAGCCTT TATCAATGCC 
TTTAAGAAAG AGCACCACGG CGTCAATCTG ACCACGCGTA CCGTACTGGA ATCACAGGGC 
AGTGCGGAGC TGG CGAAGAA GCTCAAGAAT ACGCTGTTGT CCCTGGACAG TGGTGAAAGT 
ATGAGCTTCA GCCGGTCATA TGGCGGGGGC GTCAGCACTG TCTTTGTGCC TACCCTTAGC 3 9 00 
AAGAAGGTGC CAGTTCCGGT GATCCCCGGA GCCGGCATCA CGCTGGATCG CGCCTATAAC 3 96 0 
CTGAGCTTCA GTCG TACCAG CGGCGGATTG AACGTCAGTT TTGGCCGCGA CGGCGGGGTG 4 02 0 

AGTGGTAACA TCATGGTCGC TACCGGCCAT GATGTGATGC CCTATATGAC CGG TAAGAAA 4 08 0 

ACCAGTGCAG GTAACGCCAG TGACTGGTTG AG CG CAAAAC ATAAAATCAG CCCGGACTTG 414 0 

CGTATCGGCG CTGCTGTGAG TGGCACCCTG CAAGGAACGC TACAAAACAG CCTGAAGTTT 4 20 0 

AAGCTGACAG AGGATGAGCT GCCTGGCTTT ATCCATGGCT TGACGCATGG CACGTTGACC 4 26 0 

CCGGCAGAAC TGTTGCAAAA GGGGATCGAA CAT GAG ATGA AG CAGGG C AG CAAACTGACG 432 0 

TTTAGCGTCG ATACCTCGGC AAATCTGGAT CTGCGTGCCG GTATCAATCT GAACGAAGAC 43 80 

GGCAGTAAAC CAAATGGTGT CACTGCCCGT GTTTCTGCCG GGCTAAGTGC ATCGGCAAAC 44 4 0 

CTGGCCGCCG GCTCGCGTGA ACGCAGCACC ACCTCTGGCC AGTTTGGCAG CACGACTTCG 4500 

GCCAGCAATA ACCGCCCAAC CTTCCTCAAC GGGGTCGGCG CGGGTGCTAA CCTGACGGCT 4 56 0 

GCTTTAGGGG TTGCCCATTC ATCTACGCAT GAAGGGAAAC CGGTCGGGAT CTTCCCGGCA 4 62 0 

TTTACCTCGA CCAATGTTTC GGCAGCGCTG GCGCTGGATA ACCGTACCTC ACAGAGTATC 46 8 0 

AGCCTGGAAT TGAAGCGCGC GG AG C CGG TG ACCAGCAACG ATATCAGCGA GTTGACCTCC 4 74 0 

ACGCTGGGAA AACACTTTAA GGATAGCGCC ACAACGAAGA TGCTTGCCGC TCTCAAAGAG 4 800 

TTAGATGACG CTAAGCCCGC TGAACAACTG CAT ATT T TAC AGCAGCATTT CAGTGCAAAA 4 860 

GATGTCGTCG GTGATGAACG CTACGAGG CG GTGCGCAACC TGAAAAAACT GGTGATACGT 4 92 0 

CAACAGG CTG CGGACAGCCA CAG CATGGAA TTAGGATCTG CCAGTCACAG CACGACCTAC 4980 

AATAATCTGT CGAGAATAAA TAATGACGGC ATTGTCGAGC TGCTACACAA ACATTTCGAT 504 0 

GCGGCATTAC CAGCAAGCAG TG CCAAACGT CTTGGTGAAA TGATGAATAA CGATCCGGCA 5100 

CTGAAAGATA TTATTAAGCA GCTGCAAAGT ACGCCGTTCA GCAGCGCCAG CGTGTCGATG 5160 

GAG CTG AAAG ATGGTCTGCG TGAGCAGACG GAAAAAGCAA TACTGGACGG TAAGGT CGG T 52 2 0 

CGTGAAGAAG TGGGAGTACT TTTCCAGGAT CGTAACAACT TGCGTGTTAA ATCGGTCAGC 52 8 0 

GTCAGTCAGT CCGTCAGCAA AAGCGAAGGC TTCAATACCC CAGCGCTGTT ACTGGGGACG 53 4 0 



5517 
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AGCAACAGCG CTGCTATGAG CATGGAGCGC AACATCGGAA CCATTAATTT TAAATACGGC 5400 
CAGGATCAGA ACACCCCACG GCGATTTACC CTGGAGGGTG GAATAGCTCA GGCTAATCCG 5460 
CAGGTCGCAT CTGCGCTTAC TGATTTGAAG AAGGAAGGG C TGGAAATGAA GAGCTAA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 3 8 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 
1 



Glu Leu Lys Ser Leu Gly Thr Glu His Lys Ala Ala Val His Thr 
5 10 15 



Ala Ala His Asn Pro Val Gly His Gly Val Ala Leu Gin Gin Gly Ser 
20 25 30 

Ser Ser Ser Ser Pro Gin Asn Ala Ala Ala Ser Leu Ala Ala Glu Gly 
35 40 45 

Lys Asn Arg Gly Lys Met Pro Arg He His Gin Pro Ser Thr Ala Ala 

55 60 



50 

Asp Gly He Ser Ala 
65 



Ala His Gin Gin Lys Lys Ser Phe Ser Leu Arg 
70 75 80 



Gly Cys Leu Gly Thr Lys Lys Phe Ser Arg Ser Ala Pro Gin Gly Gin 
85 90 95 

Pro Gly Thr Thr His Ser Lys Gly Ala Thr Leu Arg Asp Leu Leu Ala 
100 105 HO 

Arg Asp Asp Gly Glu Thr Gin His Glu Ala Ala Ala Pro Asp Ala Ala 
115 I 20 125 

Ara Leu Thr Arg Ser Gly Gly Val Lys Arg Arg Asn Met Asp Asp Met 
■ 130 135 1« 

Ala Gly Arg Pro Met Val Lys Gly Gly Ser Gly Glu Asp Lys Val Pro 
145 150 ' 155 160 

Thr Gin Gin Lys Arg His Gin Leu Asn Asn Phe Gly Gin Met Arg Gin 
165 I 70 175 

Thr Met Leu Ser Lys Met Ala His Pro Ala Ser Ala Asn Ala Gly Asp 
180 185 190 

Arq Leu Gin His Ser Pro Pro His He Pro Gly Ser His His Glu He 
195 200 205 



52 



Lys Glu Glu Pro Val Gly Ser Thr Ser Lys Ala Thr Thr Ala His Ala 

210 215 2 20 

Asp Arg Val Glu He Ala Gin Glu Asp Asp Asp Ser Glu Phe Gin Gin 

225 230 235 240 



Leu His Gin Gin Arg Leu Ala Arg Glu Arg Glu Asn Pro Pro Gin Pro 
245 2 5o 255 

Pro Lys Leu Gly Val Ala Thr Pro lie Ser Ala Arg Phe Gin Pro Lvs 
260 265 270 

Leu Thr Ala Val Ala Glu Ser Val Leu Glu Gly Thr Asp Thr Thr Gin 
275 280 



285 



Ser Pro Leu Lys Pro Gin Ser Met Leu Lys Gly Ser Gly 
290 295 300 



Ala Gly Val 



Thr Pro Leu Ala Val Thr Leu Asp Lys Gly Lys Leu Gin Leu Ala Pro 
305 310 315 320 

Asp Asn Pro Pro Ala Leu Asn Thr Leu Leu Lys Gin Thr Leu Gly Lys 
325 330 335 

Asp Thr Gin His Tyr Leu Ala His His Ala Ser Ser Asp Gly Ser Gin 
340 345 3 5 o 

His Leu Leu Leu Asp Asn Lys Gly His Leu Phe Asp He Lys Ser Thr 
355 360 365 

370 ^ G1U Ile LYS 



375 



380 



Gly Lys Leu Ala Gin Ala Gly Thr Gly Ser Val Ser Val Asp Gly Lv, 
385 390 395 



s 

400 



Ser Gly Lys Ile Ser Leu Gly Ser Gly Thr Gin Ser His Asn Lys Thr 
405 410 



415 



Met Leu Ser Gin Pro Gly Glu Ala His Arg Ser Leu Leu Thr Gly He 

420 425 430 

Trp Gin His Pro Ala Gly Ala Ala Arg Pro Gin Gly Glu Ser Ile Arg... 

435 440 445 

Leu His Asp Asp Lys Ile His Ile Leu His Pro Glu Leu Gly Val Trp 

450 455 460 

Gin Ser Ala Asp Lys Asp Thr His Ser Gin Leu 



465 



470 



Ser Arg Gin Ala Asp 
475 480 

Gly Lys Leu Tyr Ala Leu Lys Asp Asn Arg Thr Leu Gin Asn Leu Ser 
485 490 4g5 

Asp Asn Lys Ser Ser Glu Lys Leu Val Asp Lys He Lys Ser Tyr Ser 



500 



505 



510 



Val Asp Gin Arg Gly Gin Val Ala Ile Leu Thr Asp Thr Pro Glv Ara 

520 525 



515 



His Lys Met Ser Ile Met Pro Ser Leu Asp Ala Ser Pro Glu Ser Hi< 
530 535 540 
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He Ser Leu Ser 
545 

Gly Lys Ser Glu 



Leu Val Val Ala 
580 

Lys Gin Gly Asp 
595 

Ala Leu Asp Glu 
610 

His Asp Asp His 
625 

Gin Gin His Ala 



Trp Asn Leu Thr 
660 

His Thr Asn Pro 
675 

Ser Leu Ala Leu 
690 

Lys Gly Trp Thr 
705 

Leu Asp Gly Ala 



Asn He Asn Gin 
740 

Phe Ser Leu Pro 
755 

Gin Gly Leu Asn 
770 

Val Asn Lys Tyr 
785 

Gin He Lys Pro 



Ser Arg Glu Gly 
820 

Lys Gin Asn Leu 

835 

Pro Arg Glu Ala 
850 



Leu His Phe Ala 
550 

Leu Glu Ala Gin 
565 

Asp Ser Glu Gly 



Gly Asn Glu Leu 
600 

His Phe Gly His 
615 

Gly Gin Leu Asn 

630 

Cys Pro Leu Gly 
645 

Asp Ala Leu Val 



Glu Pro His Glu 
680 

Gin Glu Gly Lys 
695 

Gly Ala Glu Ser 
710 

Ala Tyr Leu Leu 
725 

Ser Thr Ser Ser 



His Val Arg Asn 
760 

Lys Asp Asp Lys 
775 

Leu Ala Leu Thr 
790 

Gly Thr Gin Gin 
805 

lie Ser Gly Glu 



Tyr Ala Leu Thr 
840 

Trp Gin Asn Gly 
855 



Asp Ala His Gin 
555 

Ser Val Ala He 
570 

Lys Leu Phe Ser 
585 

Lys Met Lys Ala 



Asp His Gin He 
620 

Ala Leu Val Lys 

635 

Asn Asp His Gin 
650 

He Asp Asn Gin 
665 

He Leu Asp Met 



Leu His Tyr Phe 
700 

Asp Cys Lys Gin 
715 

Lys Asp Gly Glu 
730 

He Lys His Gly 
745 

Lys Pro Glu Pro 



Ala Gin Ala Met 
780 

Glu Lys Gly Asp 

795 

Leu Glu Arg Pro 
810 

Leu Lys Asp He 
825 

His Glu Gly Glu 



Ala Glu Ser Ser 
860 



Gly Leu Leu His 
560 

Ser His Gly Arg 
575 

Ala Ala He Pro 
590 

Met Pro Gin His 
605 

Ser Gly Phe Phe 



Asn Asn Phe Arg 
640 

Phe His Pro Gly 
655 

Leu Gly Leu His 
670 

Gly His Leu Gly 

685 

Asp Gin Leu Thr 



Leu Lys Lys Gly 
720 

Val Lys Arg Leu 
735 

Thr Glu Asn Val 
750 

Gly Asp Ala Leu 
765 

Ala Val He Gly 



He Arg Ser Phe 
800 

Ala Gin Thr Leu 
815 

His Val Asp His 
83 0 

Val Phe His Gin 
845 

Ser Trp His Lys 
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Leu Ala Leu Pro Gin Ser Glu Ser Lys Leu Lys Ser Leu Asp Met Ser 
865 870 875 880 

His Glu His Lys Pro lie Ala Thr Phe Glu Asp Gly Ser Gin His Gin 
885 890 895 

Leu Lys Ala Gly Gly Trp His Ala Tyr Ala Ala Pro Glu Arg Gly Pro 
900 905 910 

Leu Ala Val Gly Thr Ser Gly Ser Gin Thr Val Phe Asn Arg Leu Met 
915 920 925 

Gin Gly Val Lys Gly Lys Val He Pro Gly Ser Gly Leu Thr Val Lys 
930 935 940 

Leu Ser Ala Gin Thr Gly Gly Met Thr Gly Ala Glu Gly Arg Lys Val 
945 9 50 955 960 

Ser Ser Lys Phe Ser Glu Arg He Arg Ala Tyr Ala Phe Asn Pro Thr 
965 970 975 

Met Ser Thr Pro Arg Pro He Lys Asn Ala Ala Tyr Ala Thr Gin His 
980 985 990 

Gly Trp Gin Gly Arg Glu Gly Leu Lys Pro Leu Tyr Glu Met Gin Gly 
995 1000 1005 

Ala Leu He Lys Gin Leu Asp Ala His Asn Val Arg His Asn Ala Pro 
1010 1015 1020 

Gin Pro Asp Leu Gin Ser Lys Leu Glu Thr Leu Asp Leu Gly Glu His 
1025 1030 1035 1040 

Gly Ala Glu Leu Leu Asn Asp Met Lys Arg Phe Arg Asp Glu Leu Glu 
1045 1050 1055 

Gin Ser Ala Thr Arg Ser Val Thr Val Leu Gly Gin His Gin Gly Val 
1060 1065 1070 

Leu Lys Ser Asn Gly Glu He Asn Ser Glu Phe Lys Pro Ser Pro Gly 
1075 1080 1085 

Lys Ala Leu Val Gin Ser Phe Asn Val Asn Arg Ser Gly Gin Asp Leu 
1090 1095 lioo 

Ser Lys Ser Leu Gin Gin Ala Val His Ala Thr Pro Pro Ser Ala Glu 
1105 mo . ins 1120 

Ser Lys Leu Gin Ser Met Leu Gly His Phe Val Ser Ala Gly Val Asp 
1125 H30 1135 

Met Ser His Gin Lys Gly Glu He Pro Leu Gly Arg Gin Arg Asp Pro 
1140 H45 1150 

Asn Asp Lys Thr Ala Leu Thr Lys Ser Arg Leu He Leu Asp Thr Val 
1155 H60 H65 

Thr He Gly Glu Leu His Glu Leu Ala Asp Lys Ala Lys Leu Val Ser 
1170 H75 H80 



Asp His Lys Pro Asp 
1185 



Ala Asp Gin He Lys Gin Leu Arg Gin Gin 
H90 H95 



Phe 
1200 



• 
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Asp Thr Leu Arg Glu Lys Arg Tyr Glu Ser Asn Pro Val Lys His Tyr 
1205 1210 1215 

Thr Asp Met Gly Phe Thr His Asn Lys Ala Leu Glu Ala Asn Tyr Asp 
1220 1225 1230 

Ala Val Lys Ala Phe He Asn Ala Phe Lys Lys Glu His His Gly Val 
1235 1240 1245 

Asn Leu Thr Thr Arg Thr Val Leu Glu Ser Gin Gly Ser Ala Glu Leu 
1250 1255 1260 

Ala Lys Lys Leu Lys Asn Thr Leu Leu Ser Leu Asp Ser Gly Glu Ser 
1265 1270 1275 1280 

Met Ser Phe Ser Arg Ser Tyr Gly Gly Gly Val Ser Thr Val Phe Val 
1285 1290 1295 

Pro Thr Leu Ser Lys Lys Val Pro Val Pro Val He Pro Gly Ala Gly 
1300 I 305 1310 

He Thr Leu Asp Arg Ala Tyr Asn Leu Ser Phe Ser Arg Thr Ser Gly 
1315 1320 1325 

Gly Leu Asn Val Ser Phe Gly Arg Asp Gly Gly Val Ser Gly Asn He 
1330 1335 1340 

Met Val Ala Thr Gly His Asp Val Met Pro Tyr Met Thr Gly Lys Lys 
1345 13 50 1355 

Thr Ser Ala Gly Asn Ala Ser Asp Trp Leu Ser Ala Lys His Lys lie 
1365 1370 1375 

Ser Pro Asp Leu Arg He Gly Ala Ala Val Ser Gly Thr Leu Gin Gly 
1380 1385 1390 

Thr Leu Gin Asn Ser Leu Lys Phe Lys Leu Thr Glu Asp Glu Leu Pro 
1395 I 400 1405 

Gly Phe lie His Gly Leu Thr His Gly Thr Leu Thr Pro Ala Glu Leu 

1410 1415 1420 

Leu Gin Lys Gly He Glu His Gin Met Lys Gin Gly Ser Lys Leu Thr 
1425 I" 0 1435 

Phe Ser Val Asp Thr Ser Ala Asn Leu Asp Leu Arg Ala Gly lie Asn 
1445 1450 1455 

Leu Asn Glu Asp Gly Ser Lys Pro Asn Gly Val Thr Ala Arg Val Ser 
1460 1465 1470 

Ala Gly Leu Ser Ala Ser Ala Asn Leu Ala Ala Gly Ser Arg Glu Arg 

1475 I 480 • 1485 

Ser Thr Thr Ser Gly Gin Phe Gly Ser Thr Thr Ser Ala Ser Asn Asn 

1490 I 495 1500 

Arg Pro Thr Phe Leu Asn Gly Val Gly Ala Gly Ala Asn Leu Thr Ala 
1505 1510 ^ 
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Ala Leu Gly Val Ala His Ser Ser Thr His Glu Gly Lys Pro Val Gly 
1525 1530 1535 

lie Phe Pro Ala Phe Thr Ser Thr Asn Val Ser Ala Ala Leu Ala Leu 
1540 1545 1550 

Asp Asn Arg Thr Ser Gin Ser He Ser Leu Glu Leu Lys Arg Ala Glu 
1555 1560 1565 

Pro Val Thr Ser Asn Asp He Ser Glu Leu Thr Ser Thr Leu Gly Lys 
1570 1575 1580 

His Phe Lys Asp Ser Ala Thr Thr Lys Met Leu Ala Ala Leu Lys Glu 
1585 1590 1595 1600 

Leu Asp Asp Ala Lys Pro Ala Glu Gin Leu His He Leu Gin Gin His 
1605 1610 1615 

Phe Ser Ala Lys Asp Val Val Gly Asp Glu Arg Tyr Glu Ala Val Arg 
1620 1625 1630 

Asn Leu Lys Lys Leu Val He Arg Gin Gin Ala Ala Asp Ser His Ser 
1635 1640 1645 

Met Glu Leu Gly Ser Ala Ser His Ser Thr Thr Tyr Asn Asn Leu Ser 
1650 1655 1660 

Arg He Asn Asn Asp Gly He Val Glu Leu Leu His Lys His Phe Asp 
1665 1670 1675 1680 

Ala Ala Leu Pro Ala Ser Ser Ala Lys Arg Leu Gly Glu Met Met Asn 
1685 1690 1695 

Asn Asp Pro Ala Leu Lys Asp He He Lys Gin Leu Gin Ser Thr Pro 
1700 1705 1710 

Phe Ser Ser Ala Ser Val Ser Met Glu Leu Lys Asp Gly Leu Arg Glu 
1715 1720 1725 

Gin Thr Glu Lys Ala He Leu Asp Gly Lys Val Gly Arg Glu Glu Val 
1730 1735 1740 

Gly Val Leu Phe Gin Asp Arg Asn Asn Leu Arg Val Lys Ser Val Ser 
1745 1750 1755 1760 

Val Ser Gin Ser Val Ser Lys Ser Glu Gly Phe Asn Thr Pro Ala Leu 
1765 1770 1775 

Leu Leu Gly Thr Ser Asn Ser Ala Ala Met Ser Met Glu Arg Asn He 
1780 1785 1790 

Gly Thr He Asn Phe Lys Tyr Gly Gin Asp Gin Asn Thr Pro Arg Arg 
1795 1800 1805 

Phe Thr Leu Glu Gly Gly He Ala Gin Ala Asn Pro Gin Val Ala Ser 
1810 1815 1820 



Ala Leu Thr Asp Leu Lys Lys Glu Gly Leu Glu Met Lys Ser 
1825 1830 1835 



-57- 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



CGGGTGTAAA 
TGAGGAGGCG 
AATCATTGAG 
TTTTGAAATG 
TTTATGTTTT 
CGGCTTCATC 
CGCGGCATAA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGACATCGT CACAGCAGCG GGTTGAAAGG TTTTTACAGT ATTTCTCCGC 
ACGCCCATAC ATCTGAAAGA CGGGGTGTGC GCCCTGTATA ACGAACAAGA 
GCGGTGCTGG AAGTACCGCA ACACAGCGAC AGCCTGTTAC TACACTGCCG 
GCTGACCCAC AAACTTCAAT AACCCTGTAT TCGATGCTAT TACAGCTGAA 
GCGGCCATGC GCGGCTGTTG GCTGGCGCTG GATGAACTGC ACAACGTGCG 
CAGCAGTCGC TGGAGCATCT GGATGAAGCA AGTTTTAGCG ATATCGTTAG 
GAACATGCGG CAGAAGTG CG TGAGTATATA GCGCAATTAG A CG AG AG TAG 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Thr Ser Ser Gin Gin Arg Val Glu Arg Phe Leu Gin Tyr Phe 



10 



Ser 



15 



Ala Gly Cys Lys Thr Pro He His Leu Lys Asp Gly Val 



25 



Cys Ala Leu 
30 

Tyr Asn Glu Gin Asp Glu Glu Ala Ala Val Leu Glu Val Pro Gin His 
35 40 45 

Ser Asp ser Leu Leu Leu His Cys Arg He lie Glu Ala Asp Pro Gin 

55 60 

Thr Ser He Thr Leu Tyr Ser Met Leu Leu Gin Leu Asn Phe Glu Met 

70 7c 

/b 80 

Ala Ala Met Arg Gly Cys Trp Leu Ala Leu Asp Glu Leu His Asn Val 



60 
120 
180 
240 
300 
360 
420 



90 



95 
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Arg Leu Cys Phe Gin Gin Ser Leu Glu His Leu Asp Glu Ala Ser Phe 
100 105 



110 



Ser Asp He Val Ser Gly Phe He Glu Hi 



115 



120 



s Ala Ala Glu Val Arg Glu 



125 



Tyr He Ala Gin Leu Asp Glu Ser Ser Ala Ala 
130 135 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGAACCNNNN NNNNNNNNNN NCAACATAA 



29 



